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HEME PROTEINS HEMAT-/7S AND HEMAT-B5 AND THEIR USE IN 
MEDICINE AND MICROSENSORS 



The subject matter of this application was made with support from the United 
States Government under Grant No. MSB960086 from the National Science Foundation. 
The United States Government may retain certain rights. 

BACKGROUND OF THE INVENTION 

Heme proteins such as hemoglobin and myoglobin play an essential role in 
stabilizing molecular oxygen for transport and storage. The oxygen carrying portion of 
the red blood cell is hemoglobin, a tetrameric protein molecule composed of two identical 
alpha globins (alpha 1, alpha 2), two identical beta globins (beta 1, beta 2) and four heme 
molecules. A heme molecule is incorporated into each of the alpha and beta globins to 
give alpha and beta subunits. Heme is a macrocyclic organic molecule that contains an 
iron atom at its center; each heme can combine reversibly with one ligand molecule, for 
example oxygen. In a hemoglobin tetramer, each alpha subunit is associated with a beta 
subunit to form two stable alpha/beta dimers, which in turn associate to form the tetramer 
(a homodimer). The subunits are noncovalently associated through Van der Waals forces, 
hydrogen bonds and salt bridges. Ligands, particularly oxygen, bind reversibly to the 
reduced form of the iron (ferrous, Fe 2+ ) in the heme. Other ligands which compete with 
oxygen for the heme group include carbon monoxide and nitric oxide. 

It is not always practical to transfuse a patient with donated blood. The well 
known complications of blood transfusion namely incompatibility reactions, disease 
transmission, immunosuppression and the storage limitations of erythrocytes points to the 
need for the development of blood substitutes devoid of these shortcomings. In these 
situations, use of a red blood cell substitute is necessary. A "blood substitute" is a 
preparation that does not necessarily replace blood in all of its functions, but an 
emergency resuscitative fluid that is capable of efficiently transporting oxygen to tissue. 
This fluid, however, must be free of toxic side-effects, as well as of agents of disease such 
as bacteria and viruses. 

For over 50 years, efforts directed to the development of a blood substitute have 
focused on hemoglobin (Hb). Hemoglobin (Hgb) is the oxygen-carrying component of 
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blood. Hemoglobin circulates through the bloodstream inside small enucleate cells called 
erythrocytes (red blood cells). Hemoglobin is a protein constructed from four associated 
polypeptide chains, and bearing prosthetic groups known as hemes. The erythrocyte helps 
maintain hemoglobin in its reduced, functional form. The heme iron atom is labile to 
5 oxidation, but may be reduced again by one of two enzyme systems within the 
erythrocyte, the cytochrome b5 and glutathione reduction systems. 

Hemoglobin exhibits cooperative binding of oxygen by the four subunits of the 
hemoglobin molecule (two alpha-globins and two beta-globins in the case of HbA), and 
this cooperativity greatly facilitates efficient oxygen transport. Cooperativity, achieved 

10 by the so-called heme-heme interaction, allows hemoglobin to vary its affinity for 

oxygen. Hemoglobin reversibly binds up to four moles of oxygen per mole of Hb. At 
high oxygen concentration, such as that found in the lungs, the oxygen affinity is high and 
hemoglobin is almost saturated with oxygen. At low oxygen concentration, such as that 
found in actively respiring tissue, the oxygen affinity is lowered and oxygen is unloaded. 

1 5 The oxygen affinity of hemoglobin is lowered by the presence of 2,3 -diphosphoglycerate 
(2,3-DPG), chloride ions and hydrogen ions. Respiring tissue releases carbon dioxide into 
the blood and lowers its pH (i.e. increases the hydrogen ion concentration), thereby 
causing oxygen to dissociate from hemoglobin and allowing it to diffuse into individual 
cells. 

20 The ability of hemoglobin to alter its oxygen affinity, increasing the efficiency of 

oxygen transport around the body, is dependent on the presence of the metabolite 2,3- 
DPG. Inside the erythrocyte 2,3-DPG is present at a concentration nearly as great as that 
of hemoglobin itself. In the absence of 2,3-DPG "conventional" hemoglobin binds oxygen 
very tightly and would release little oxygen to respiring tissue. 

25 Aging erythrocytes release small amounts of free hemoglobin into the blood 

plasma where it is rapidly bound by the scavenging protein haptoglobin. The 
hemoglobin-haptoglobin complex is removed from the blood and degraded by the spleen 
and liver. 

It is clear from the above considerations that free native hemoglobin A, injected 
30 directly into the bloodstream, would not support efficient oxygen transport about the 

body. The essential allosteric regulator 2,3-DPG is not present in sufficient concentration 
in the plasma to allow hemoglobin to release much oxygen at venous oxygen tension, and 



free hemoglobin would be rapidly inactivated as an oxygen carrier by auto-oxidation of 
the heme iron. 

Therefore, a need exists for a substitute other than hemoglobin which can bind and 
carry oxygen to cells. This substitute may also be used in other applications where 
5 hemoglobin is used, including as a biological sensor for oxygen. The present invention 
provides proteins which meet that need. 

SUMMARY OF THE INVENTION 

10 The present invention provides isolated archaeal and bacterial heme binding 

proteins which reversibly bind oxygen with a low affinity. 

The invention also provides a blood substitute containing the bacterial heme 

binding protein which reversibly binds oxygen with a low affinity. 

Another embodiment of the invention is a method for controlled storage of 
1 5 oxygen. A bacterial heme binding protein which reversibly binds oxygen with a low 

affinity is contacted with oxygen allowing the protein to bind and store oxygen. 

The invention also provides a method of sensing gaseous ligands. A heme binding 

bacterial protein is exposed to a test sample and a change in the conformation of the 

protein is measured. 

20 Yet another embodiment of the invention is a chimeric protein having a heme- 

binding domain and a heterologous signaling domain. 

The invention further provides an isolated nucleic acid molecule which encodes a 
heme binding bacterial protein that reversibly binds oxygen with a low affinity. 

25 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the conserved sequences within HemAT-Hs, HemAT-^s, and 
sperm-whale myoglobin (SWMb). Black boxes indicate positions at which the residues 
are identical, and gray boxes highlight residues that are similar. Sequences were aligned 
30 using the Clustal program of the MegAlign/DNASTAR package. A) Alignment of the 
amino-terminal domain of HemAT-ife, HemAT-Ss, and SWMb. Helical regions in 
SWMb (helices A-H) (B. C. Clothia, et al., J. Mol. Biol. 196:199 (1987); S. N. 
Vinogradov et al., Comp. Biochem. Physiol. 106B:1 (1993), which are hereby 



-4- 



incorporated by reference) are delineated by dotted arrows. Pro (P), Phe (F), and His (H) 
residues in SWMb that are highly conserved among all globins are marked with asterisks. 
B) Alignment of the carboxyl-terminal domains of HemAT-/£y, HemAT-ifo, and Tsr (B. 
K. Kendall, et al., Nature 301 :623 (1983); G. L. Hazelbauer, Curr. Qpin. Struct. Biol. , 
5 2:505 (1992), which are hereby incorporated by reference). 

Figure 2 is a characterization of HemAT proteins. Figure 2A shows the purified 
HemAT-Hs and HemAT-ite in 10% SDS-PAGE. Approximately 5 p.g of purified protein 
were loaded in each lane for separation during SDS-PAGE in 10% acrylamide (M. Alam 
et al., J. BacterioL 173:5837 (1991), which is hereby incorporated by reference). Lane 1, 

1 0 HemAT-tfs; lane 2, HemATfeHis-i*; and lane 3, HemAT-Bs. The MW markers (kDa) 
are shown at the left. Figure 2B is a fluorograph and immunoblot of HemAT-Hs'. 
Radiolabeling and immunoblotting were performed as previously described (M. Alam et 
al., J. Bacteriol. . 173:5837 (1991), which is hereby incorporated by reference). Lane 1, 
fluorograph of proteins from the AhemAT-Hs; lane 2, fluorograph of proteins from the 

1 5 AhemA T-HslhemA T-Hs++ strain (A. Brooun, Ph.D thesis. University of Hawaii, Hawaii 
(1997), which is hereby incorporated by reference). Ndel waAXbal restriction sites were 
used to clone the hemAT-Hs gene into the shuttle vector pKJ427. Primers introducing 
flanking Ndel saidXbal restriction sites were used for PCR amplification. The PCR 
product was initially cloned into the pCRi?-Blunt II TOPO cloning vector and later 

20 subcloned into plasmid pKJ427 after digestion with Ndel and Xbal. The resulting plasmid 
was introduced into the AhemAT-Hs strain. Lane 1 : immunoblot of AhemAT-Hs strain; 
and lane 2: immunoblot of AhemA T-Hs/hemA T-Hs++ strain using anti-transducer peptide 
antibody (W. Zhang et al., Proc. Natl. Acad. Sci. U.S.A. 93:4649 (1996), which is hereby 
incorporated by reference). Bars indicate the positions of molecular weight markers 

25 (kDa). Radiolabeling and immunoblot experiments were performed according to Alam & 
Hazelbauer (Alam et al., J. Bacterid. , 173:5837-5842 (1991)). 

Figure 3 provides a comparison of the proteins used in the homology analyses. 
Ml and M2 are the site of myoglobin recognition. M2 is the site of HemAT recognition. 
The H-box is the primary site of microbial hemoglobin recognition. 

30 Figure 4 shows absorption spectra of purified HemAT-iiy, HemAT-ifc, and horse- 

heart myoglobin (HHMb). Panel A shows oxygenated forms of purified HemAT-i2s', 
HemAT-ik, and oxymyoglobin. Panel B shows deoxygenated forms of UemAT-Hs, 



HemAT-Bs, and myoglobin. Panel C shows CO-bound forms of HemAT-ffc, HemAT- 
Bs, and myoglobin. Panel D shows reoxidized forms of HemAT-Hs, HemAT-ifo. 
Samples concentrations are approximately 20 uM in heme. Deoxygenated samples were 
prepared by the addition of sodium dithionite to the deaerated protein solutions. 
5 Figure 5 shows aerotactic responses in H. salinarum and B. subtilis. Panel A 

shows H. salinarum strain Flxl5 (HtrVIII and HemAT-/fr present), and mutant strains 
AhemAT-Hs (HtrVIII present) and AhtrVIII (HemAT-Tft present). Panel B shows wild- 
type B. subtilis strain OI1085 and mutant strains OI3545 (Aten) and OI3555 
(overexpression of hemAT-Bs in Aten). All cells were grown to mid-logarithmic phase. 

10 Microcapillaries (internal dimension 100x10 urn) were filled halfway with cell 

suspension. The capillaries were sealed at both ends and placed on a microscope stage 
prewarmed to 35-37 °C. Time-lapse, dark-field microscopic images were recorded using 
a video-digitized camera linked to a computer. The images shown were taken at 180 min 
for H. salinarum and at 30 min for B. subtilis. The meniscus at the air interface is visible 

15 to the right in each image. 

Figure 6 provides transient absorption data subsequent to CO photolysis obtained 
at 430 nm at 25°C and 1 atm CO for UemAT-Hs (solid line) and HemAT-5s (dotted line). 
Samples were approximately 20 uM. The traces are the average of 50 laser pulses (532 
nm exciation, 7 ns pulse width, 10 mJ/pulse). 

20 Figure 7 shows the transient difference spectrum (25 [is subsequent to photolysis) 

overlaid with the equilibrium difference spectrum (deoxy minus CO-bound) for HemAT- 
Hs (top panel) and HemAT-fe (bottom panel). Sample conditions are as described in 
Figure 4. 

Figure 8 provides CO-off rate data for HemAT-/fo (dashed line), HemAT-Bs 
25 (dotted line), and horse heart Mb (solid line). Changes in absorbance as a function of 
time at 418 were monitored after the addition of potassium ferricyanide (final 
concentration of 1 .5 mM). See Example 17 for details. 
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DET AILED DESCRIPTION OF THE INVENTION 

The present invention provides an isolated bacterial myoglobin-like heme binding 
protein which reversibly binds oxygen with a low affinity. These proteins are a new class 
5 of heme proteins that bind diatomic oxygen through their prosthetic group and trigger 
negative aerotactic responses. RemAT-Hs and HemAT-fe are the first myoglobin-like 
heme proteins in the Archaea and in Bacteria, respectively. Purified HemAT-f/s and 
HemAT-fe exhibit spectral properties similar to oxygen-bound myoglobin. 
Deoxygenation of either protein results in absorption shifts similar to those observed for 
10 deoxymyoglobin. The oxy-/deoxy spectral changes in HemAT-Hs and in HemAT-£s are 
completely reversible, a characteristic feature of the heme prosthetic group in myoglobin. 
The C-terminus of both proteins has high homology with the signaling domain of 
bacterial methyl-accepting chemoreceptors and they mediate aerotaxis. By site-directed 
mutagenesis the fifth coordination site of the heme iron was identified in HemAT-ift and 
1 5 HemAT-jBs comparable to myoglobin. 

In a preferred embodiment of the invention, the isolated heme-binding protein has 
both a heme binding domain and a signaling domain. 

When the hemAT-Hs gene was originally cloned, its product was predicted to be a 
soluble signal transducer (W. Zhang et al., Proc. Natl. Acad. Sci. U.S.A. 93:4649 (1996), 
20 which is hereby incorporated by reference). HemAT-#s was identified in the B. subtilis 
genome-sequencing project as the product of an open-reading frame encoding a protein 
with marked similarities to methyl-accepting chemotaxis proteins (MCP) (F. Kunst et al., 
Nature 390:249 (1997), which is hereby incorporated by reference). The predicted 
translation products of the hemAT-Hs and hemAT-Bs genes, comprising 489 and 432 
25 residues, respectively, exhibit two striking features: a) their amino-termini (residues 1- 
184 in HemAT-ifc and 1-175 in HemAT-fe) display limited homology to myoglobin 
(Figure 1A); b) residues 222 to 489 of HemAT-Hs and 198 to 432 of HemAT-fe are 30% 
identical to the cytoplasmic signaling domain of Tsr, an MCP from Escherichia coli 
(Figure IB). 

30 The residues absolutely conserved among all globins are the proximal His in the F 

helix (F8) and Phe in the CD region (CD1) (B. C. Clothia, et al., J. Mol. Biol. 196:199 
(1987); S. N. Vinogradov et al., Comp. Biochem. Physiol. 106B:1 (1993), which are 
hereby incorporated by reference). Highly conserved residues include the distal His in 
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the E helix (E7), Phe in the CD4 region, and Pro at the beginning of the C helix (C2) (B. 
C. Clothia, et al., J. Mol. Biol , 196:199 (1987); S. N. Vinogradov et al., Comp. Biochem. 
Physiol. , 106B:1 (1993), which are hereby incorporated by reference). Three of these 
residues (Pro in C2, Phe in CD1, His in F8) are conserved in both HemAT-Hs and 
5 HemAT-Bs (asterisks in Figure 1 A). These features suggested to us that HemATs may be 
heme-containing proteins that generate signals in response to binding of oxygen. 

Both proteins, HemAT-Hs and RemAT-Bs, can be expressed in E. coli from 
recombinant vectors. PCR primers with sequences flanking the hemAT-Hs gene from H 
salinarum strain Flxl 5 and encoding a Ndel or BamHl restriction site were used to 
1 0 amplify and clone the chromosomal gene into the pET expression vector (Novogen Inc.). 
The PCR product was initially ligated into the pCR R -Blunt II TOPO cloning vector 
(Invitrogen, Inc.) and then subcloned into pET-3b after digestion of the donor and 
recipient plasmids with Ndel and BamHl. The resulting plasmid was introduced into the 
E. coli pLysS strain for protein expression. PCR primers with sequences flanking the 
1 5 hemAT-Bs gene from B. subtilis strain Oil 085 and encoding a BamHl or Pstl restriction 
site were used to amplify and clone the chromosomal gene into the pCR R -Blunt II TOPO 
vector. This fragment was later subcloned into the pMALcII expression vector (New 
England Biolabs, Inc.), which was introduced into the E. coli pLysS strain for protein 
expression. 

20 Recombinant HemAT-Hs is purified using anion-exchange and gel-filtration 

chromatography. BL21 pLysS host cells harboring plasmids carrying the hemAT-Hs or 
hemAT-Bs genes were grown in Luria-Bertani broth with appropriate antibiotics, and 
synthesis of the proteins was induced with 0.6 mM isopropyl-D-thiogalactopyranoside. 
After a two hour induction, the cells were harvested by low speed centrifugation (4000 x 

25 g) at 4°C for 1 5 min. The pellets were resuspended in buffer (50 mM NaCl, 50 mM Tris- 
HC1, pH 6.0) and sonicated for 4 min (12 pulses of 20 sec with 30 sec pauses). The cell 
lysate was centrifuged at 28,000 x g for 20 min. The red supernatant became the source 
of proteins for purification. The HemAT-Hs supernatant was applied to an anion- 
exchange POROS HQ/M column equilibrated with buffer (50 mM Tris-HCl, pH 6.0). A 

30 linear gradient of NaCl (0-1 500 mM) was applied, and HemAT-Hs eluted at about 400 
mM. Fractions containing HemAT-Hs (monitored by the Soret band absorbance at 410 
rrm and SDS-PAGE) were concentrated and applied to a HiLoad Superdex 200 gel- 
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filtration column. Fractions containing HemAT-if.s' were concentrated with an Amicon 
100K concentrator. 

A saturated (NH 4 ) 2 S0 4 solution was added to 30% saturation to the HemAT-Bs 
supernatant and centrifuged at 28,000 x g for 20 min. The optically clear, light-red 
5 supernatant was further fractionated by adding (NH4) 2 S0 4 to 36% saturation, and the 

precipitate was pelleted by centrifugation. The pellet was resuspended in buffer (200 mM 
NaCl, 50 mM Tris-HCl, pH 8.0) and applied to a HiLoad Superdex 75 column. Fractions 
containing HemAT-Bs were concentrated with an Amicon 50K concentrator. 

During SDS-polyacrylamide gel electrophoresis (SDS-PAGE), purified HemAT- 

10 Hs migrates slower than expected from its calculated molecular mass of 52.8 kDa (Figure 
2A, lane 1). This behavior is consistent with the highly acidic nature of many halophilic 
proteins (K. Ihara et al., Arch. Biochem. Biophys. , 286:1 1 1 (1991), which is hereby 
incorporated by reference). HemAT-Hs is purified from H. salinarum by metal-affinity 
and gel-filtration chromatography as a recombinant protein (HemATexHis-^) carrying a 

15 carboxyl-terminal six-histidine tag (Figure 2A, lane 2). A plasmid encoding carboxyl- 
terminal 6 His-tagged HemAT-/fe was constructed by two-step PCR. In the first step, 6 
His codons were fused to hemAT-Hs immediately in front of the natural stop codon. In the 
second step, &Xbal restriction site was introduced at the 3' end of the gene. The second 
PCR product was subcloned into the Ndel and Xbal sites of plasmid pKJ427. This 

20 plasmid was introduced into a AhemAT-Hs strain (A. Brooun et al., J. Bacteriol. , 

180:1642 (1998), which is hereby incorporated by reference). Cells grown at 39 °C to 
mid-logarithmic phase were harvested by centrifugation (4000 x g) at 4 °C. The pellet 
was resuspended in buffer (200 mM NaCl, 50 mM Tris-HCl, pH 8.0) and sonicated for 3 
min (12 pulses of 15 sec with 20 sec pauses). The cell lysate was centrifuged (100,000 x 

25 g) at 14 °C for 30 min, and the supernatant was used for purification. The POROS MC/M 
affinity column was washed with 1 M NaCl, 50 mM EDTA (pH 8.0), charged with 100 
mM CoCl 2 , and finally washed with 3M NaCl. The column was equilibrated with buffer 
(200 mM NaCl, 50 mM Tris-HCl, pH 8.0) prior to loading the sample. HemAT 6xH i S -/ft 
was eluted with a linear gradient of imidazole (0-250 mM). The peak fractions were 

30 collected, concentrated, and applied to a HiLoad Superdex 200 gel-filtration column. The 
peak fractions were concentrated with an Amicon 100K concentrator. HemAT-Z?s is 
purified using a combination of ammonium-sulfate precipitation/fractionation and gel- 



filtration chromatography. As expected, purified HemAT-Ss migrates during SDS-PAGE 
as a 48.7 kDa protein (Figure 2A, lane 3). 

The preferred bacterial heme binding proteins are myoglobin-like proteins. In 
particular, the heme binding proteins show greater than 20% identity to a vertebrate 
myoglobin protein, such as sperm whale myoglobin. More preferred are proteins which 
show greater than 30% or 50% identity. The level of identity is calculated using the 
protein alignment program of BLAST with the default parameters. 

In a preferred embodiment, the heme-binding protein is isolated from Archaea. 
The Archaea are a group of organisms often found in extreme environments, such as high 
temperatures, high salt concentrations, and acidic conditions. The conditions are often so 
extreme that other organisms are unable to survive in that environment. Proteins isolated 
from the Archaea often exhibit higher stability in the presence of high temperatures, high 
salt concentrations, or low pH. Generally, the proteins isolated from Archaea are 
preferred due to their higher stability. 

In particular, the protein is isolated from Halobacterium salinarum. H. salinarum 
is a salt tolerant organism. Similarly, the HemAT-i£y protein is salt tolerant. The 
sequence of the gene encoding the HemAT-iTs protein is shown in SEQ. ID. No. 1 as 
follows: 

ATGAGCAACG ATAATGACAC TCTCGTGACC GCCGACGTTC GGAACGGGAT CGACGGGCAC 6 0 
GCACTCGCGG ACCGGATCGG CCTCGACGAG GCGGAGATCG CGTGGCGGCT GTCGTTCACC 12 0 
GGGAT CGACG ACGACACGAT GGCCGCGCTC GCCGCCGAAC AGCCGCTGTT CGAAGCCACC 18 0 
GCGGACGCGC TGGTGACCGA CTTCTACGAC CACTTGGAGT CCTACGAGCG CACACAGGAC 24 0 
CTCTTCGCGA ACTCCACGAA GACCGTCGAG CAACTCAAAG AGACGCAGGC CGAGTACTTG 300 
CTGGGCCTCG GGCGCGGCGA GTACGACACC GAGTACGCCG CCCAGCGCGC CCGTATCGGG 36 0 
AAGATACACG ACGTGCTCGG GCTCGGACCG GACGTCTATC TGGGCGCGTA CACGCGATAC 42 0 
TACACGGGGC TGTTGGACGC GCTTGCCGAC GACGTGGTCG CCGACCGCGG CGAGGAGGCG 48 0 
GCCGCCGCCG TCGACGAACT CGTGGCCCGG TTCCTGCCGA TGTTGAAGCT GTTGACCTTC 54 0 
GATCAGCAGA TCGCAATGGA CACCTACATC GACTCGTACG CCCAGCGCCT CCACGACGAG 60 0 
ATCGACAGCC GCCAGGAGTT GGCGAACGCG GTCGCCACGC ACGTGGAAGC ACCGCTGTCC 66 0 
TCGCTGGAGG CGACCTCGCA GGACGTCGCC GAGCGCACGG ACAC GATGCG GGCCCGCACC 72 0 
GACGAC CAGG TCGACCGCAT GGCTGACGTC AGCCGTGAGA TATCCAGCGT GTCCGCGAGC 78 0 
GTCGAGGAGG TCGCCTCGAC GGC CGACGAC GTCCGCCGGA CCAGCGAGGA CGCCGAGGCG 84 0 
CTGGCCCAGC AGGGCGAGGC GGCCGCCGAC GACGCGCTCG CCACGATGAC CGACATCGAC 90 0 
GAGGCGACCG ACGGCGTCAC CGCGGGCGTC GAACAGCTCG GCGAGCGCGC CGCCGACGTC 96 0 
GAAT CAGTGA CCGGCGTGAT CGACGACATC GCCGAGCAGA CGAACATGCT GGCGCTGAAC 102 0 
GCGTCCATCG AGGCCGCCCG CGCCGGGGAG GCGGGCGAGG GGTTTGCGGT CGTCGCCGAC 108 0 
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GAGGT CAAGG CCCTCGCCGA GGAGTCCCGC 
GAGCAGATGC AGGCGGAGAC CGAGGAGACG 
ATCGGCGAGG GCGTCGAGCG CGTCGAGGAG 
GCCGTCGAGG ACGCCGCAAG CGGGATGCAG 
GTGAGCACCG AGGAGGTCGC CGAGATGGTC 
GCGGCCGCCC T CGATGAC AT CGCCGACGCG 
GTCCGCGAGA CGGTCGGCAA GCTCAGCTAG 



GAGCAGTCCA CGCGCGTCGA GGAGCTCGTC 114 0 
GTCGACCAGT TGGACGAGGT CAACCAGCGC 120 0 
GCGATGGAGA CCCTCCAGGA GATCACCGAC 12 6 0 
GAGGTGTCGA CGGCGACCGA CGAACAGGCG 132 0 
GACGGTGTCG ACGACCGCGC GGGCGAGATC 13 8 0 
ACCGATCAGC AGGTC CGGAC CGTCGAGGAG 144 0 
1470 



The hemAT-Hs gene encodes a protein which has an amino acid sequence as 
shown in SEQ. ID. No. 2 as follows: 

MSND3SIDTLVTADVRNGIDGHALADRIGLDEAE IAWRLS FTGIDDDTMAALAAEQPLFEAT 6 0 

ADALVTDFYDHLESYERTQDLFANSTKTVEQLKETQAEYLLGLGRGEYDTEYAAQRARIG 12 0 

KIHDVLGLGPDVYLGAYTRYYTGLLDALADDWADRGEEAAAAVDELVARFLPiyiLKLLTF 18 0 

DQQIAMDTYIDS YAQRLHDEIDSRQELANAVATHVEAPLSSLEATSQDVAERTDTMRART 240 

DDQVDRMADVSREISSVSASVEEVASTADDVRRTSEDAEALAQQGEAAADDALATMTDID 3 00 

E ATDGVTAGVEQLGERAADVE S VTGVI DD I AEQTNMLALNAS IEAARAGEAGEGFAWAD 36 0 

EVKALAEESREQSTRVEELVEQMQAETEETVDQLDEVNQRIGEGVERVEEAMETLQEITD 42 0 

AVEDAASGMQEVSTATDEQAVSTEEVAEMVDGVDDRAGEIAAALDDIADATDQQVRTVEE 48 0 

VRETVGKLS 48 9 

In another embodiment of the invention, the heme-binding protein is isolated from 
Bacillus subtilis. Preferably, the Bacillus subtilis gene is hemAT-Bs, which has a nucleic 
acid sequence according to SEQ. ID. No. 3, as follows: 

ATGTTATTTA AAAAAGACAG AAAACAAGAA ACAGCTTACT TTTCAGATTC AAACGGACAA 6 0 

CAAAAAAACC GCATTCAGCT CACAAACAAA CATGCAGATG TCAAAAAACA GCTCAAAATG 12 0 

GTCAGGTTGG GAGATGCTGA GCTTTATGTG TTAGAGCAGC TTCAGCCACT CATTCAAGAA 18 0 

AATATCGTAA ATATCGTCGA TGCGTTTTAT AAAAACCTTG ACCATGAAAG CTCATTGATG 24 0 

GATATCATTA ATGATCACAG CTCAGTTGAC CGCTTAAAAC AAACGTTAAA ACGGCATATT 30 0 

CAGGAAATGT TTGCAGGCGT TATCGATGAT GAATTTATTG AAAAGCGTAA CCGAATCGCC 36 0 

TCCATCCATT TAAGAATCGG CCTTTTGCCA AAATGGTATA TGGGTGCGTT TCAAGAGCTC 42 0 

CTTTTGTCAA TGATTGACAT TTATGAAGCG TCCATTACAA ATCAGCAAGA ACTGCTAAAA 48 0 

GCCATTAAAG CAACAACAAA AATCTTGAAC TTAGAACAGC AGCTTGTCCT TGAAGCGTTT 54 0 

CAAAGC GAGT ACAACCAGAC CCGTGATGAA CAAGAAGAAA AGAAAAACCT TCTTCATCAG 60 0 

AAAATT CAAG AAACCTCTGG ATCGATTGCC ATTCTGTTTT CAGAAACAAG CAGATCAGTT 66 0 



CAAGAGCTTG TGGACAAATC TGAAGGCATT TCTCAAGCAT CCAAAGCCGG CACTGTAACA 72 0 

TCCAGCACTG TTGAAGAAAA GTCGATCGGC GGAAAAAAAG AGCTAGAAGT CCAGCAAAAA 78 0 

CAGATGAACA AAATTGACAC AAGCCTTGTC CAAATCGAAA AAGAAATGGT CAAGCTGGAT 84 0 

GAAATCGCGC AGCAAATTGA AAAAATCTTC GGCATCGTCA CAGGCATAGC TGAACAAACA 90 0 

AACCTTCTGT CGCTCAATGC ATCTATTGAA TCGGCCCGCG CCGGAGAACA CGGGAAAGGC 96 0 

TTTGCTGTCG TGGCAAATGA AGTGCGGAAG CTTTCTGAGG ATACGAAAAA AACCGTCTCT 102 0 

ACTGTTTCTG AGCTTGTGAA CAATACGAAT AC ACAAAT C A ACATTGTATC CAAGCATATC 108 0 

AAAGACGTGA ATGAGCTAGT CAGCGAAAGT AAAGAAAAAA TGACGCAAAT TAACCGCTTA 114 0 

TTCGATGAAA TCGTCCACAG CATGAAAATC AGCAAAGAGC AATCAGGCAA AATCGACGTC 120 0 

GATCTGCAAG CCTTTCTTGG AGGGCTTCAG GAAGTCAGCC GCGCCGTTTC CCATGTGGCC 126 0 

GCTTCCGTTG ATTCGCTTGT CATCCTGACA GAAGAATAAC CATCAAAAAC CGGTCTGCCA 132 0 

TACGGCCGGT TTTTTTGCGT TCATTATGTA AACTTAAATT AAAAATCAGT TGACATAATA 138 0 

ATTACCTGCA 13 90 



In a preferred embodiment, the protein has an amino acid sequence of SEQ. 
ID. No. 4, as follows: 



ML FKKDRKQETAYF SDSNGQQKNR I QLTNKHADVKKQLKMVRLGDAELYVLEQLQPL I QE 6 0 

NIVNIVDAFYKNLDHESSLMDIINDHSSVDRLKQTLKRHIQEMFAGVIDDEFIEKRNRIA 120 

SIHLRIGLLPKWYMGAFQELLLSMIDIYEASITNQQELLKAIKATTKILNLEQQLVLEAF 180 

QSEYNQTRDEQEEKKNLLHQKIQETSGSIANLFSETSRSVQELVDKSEGISQASKAGTVT 240 

SSTVEEKSIGGKKELEVQQKQMNKIDTSLVQIEKEMVKLDEIAQQIEKIFGIVTGIAEQT 300 

NLLSLNAS IESARAGEHGKGFAWANEVRKLSEDTKKTVSTVSELVNNTNTQINIVSKHI 360 

KDVNELVSESKEKMTQINRLFDEIVHSMKISKEQSGKIDVDLQAFLGGLQEVSRAVSHVA 42 0 

ASVDSLVILTEE 432 



The invention also provides fragments of the isolated heme-binding protein which 
contain a functional heme-binding domain. The fragment containing the functional 
heme-binding domain may be coupled to a heterologous signal transduction domain. As 
described in the examples, a minimum heme binding domain has been determined for 
HemAT-Hs and partially determined for HemAT-ifo. Furthermore, comparisons 
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beteween various globin proteins has allowed for the identification of conserved regions 
of the proteins. 

HemAT-ffe in Halobacterium salinarum and HemAT-ifc in Bacillus subtilis, the 
first aerotactic transducers discovered that directly bind oxygen, are heme-based, and are 
5 homologous to native sperm whale myoglobin (SWMb), albeit more structural than 

sequential. These proteins belong to the globin family. Globins bind, transport, and store 
oxygen, and are known to exhibit a distinctive fold of seven cc-helices that encompass a 
heme prosthetic group. The seven helices are labeled A, B, C, E, F, G, and H. 
Sometimes, an additional short helix (helix D) is found between helices C and E, as in the 

10 case of SWMb, to make a total of eight. In a 1987 publication, Bashford et al., 
"Determinants of a Protein Fold: Unique Features of the Globin Amino Acid 
Sequences," J. Mol. Biol. , 196:199-216 (1997), which is hereby incorporated by 
reference, reported that the sequence homology of all the 226 globin sequences known at 
that time were "as high as 80% or more for closely related species, or as low as 16% for 

15 more distant ones." Of all these proteins, only two residues were absolutely conserved 
throughout. These two residues were the phenylalanine at the end of the C helix (CD1) 
and the proximal histidine (F8). HemAT-i7s and HemAT-Zfo both contain these two key 
residues and are 23% and 1 1% homologous to SWMb, respectively, and share 20% 
sequence similarity between themselves. 

20 The report of myoglobin-type aerotaxis proteins in microorganisms, and the recent 

discovery of HemAT-ift and HemAT-fe has prompted an effort to find one or more 
signature motifs in these possible microbial globins. These would identify conserved 
regions of the proteins. In addition, with these motifs in hand, contemporary computer 
algorithms like those contained in the BLAST programs 

25 (http://www.ncbi.nlm.nih.gov/BLAST/) could permit convenient and rapid searches for 
other possible globins using this signature motif. These motifs could be used for 
classifying these newly discovered microbial globins together and eventually with the 
whole globin family. 

Vianogradov et al., "Adventitious Variability? The Amino Acid Sequences of 

3 0 Nonvertebrate Globins," Comp. Biochem. Physiol , 1 06B : 1 -26 ( 1 993), which is hereby 
incorporated by reference, have noted the extensive variation of invertebrate globins over 
the vertebrates and Bashford et al., "Determinants of a Protein Fold: Unique Features of 
the Globin Amino Acid Sequences," J. Mol. Biol. , 196:199-216 (1987), which is hereby 
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incorporated by reference, have recognized that alignments of invertebrate globins with 
vertebrate globins based strictly on sequence similarity and vertebrate data sets are 
questionable. Invertebrate myoglobins were therefore not included in the preliminary 
data and the search for a globin motif was limited to vertebrates. Microbial globins, 
5 however, were later included and incorporated into the alignment by conserving 

secondary structure and avoiding gaps as in the work of Kapp et al., "Alignment of 700 
Globin Sequences: Extent of Amino Acid Substitution and Its Correlation With Variation 
in Volume," Pro. Sci. , 4:2179-2190 (1995), which is hereby incorporated by reference. 

An 80-aa consensus peptide sequence was constructed using the manual alignment 

10 of sperm whale myoglobin (SWMb), the oxygen sensor in Bacillus subtilis, HemAT-fe, 
and the oxygen sensor in Halobacterium salinarum, HemAT-ife. The intent was to find a 
minimal length of protein containing the myoglobin signature motif and see how many 
myoglobin proteins this sequence would recognize on the non-redundant (nr) database at 
NIH using the BLAST server (http://www.ncbi.nlm.nih.gov/BLAST/). An X was issued 

15 to residues of high variability (Bashford et al., "Determinants of a Protein Fold: Unique 
Features of the Globin Amino Acid Sequences," J. Mol. Biol. , 196:199-216 (1987), which 
is hereby incorporated by reference) while conserved residues retained their specific 
amino acid designation. Critical to the alignment was the positioning of the two residues 
known to be absolutely conserved in all known globins: Phe at the CD1 position and the 

20 proximal His at the F8 position (Bashford et al., "Determinants of a Protein Fold: Unique 
Features of the Globin Amino Acid Sequences," J. Mol. Biol , 196:199-216 (1987), which 
is hereby incorporated by reference). Using these residues as markers, the myoglobin- 
like protein (MbLP) sequence was generated and consisted of two domains separated by 
32 variable amino acids. The first myoglobin-type domain (Ml -box) contained the 

25 absolutely conserved phenylalanine residue; the second (M2-box) contained the 
absolutely conserved proximal histidine. A BLAST search was then performed, 
comparing the sequences of MbLP and SWMb with those of all other proteins in the non- 
redundant database. Search parameters were default except for the EXPECT parameter, 
which was increased to 1000 to allow for matches of lesser sequence homology. This 

30 comparison between the number and type of SWMb hits and MbLP hits was used to 
assess the quality of the MbLP sequence in extracting myoglobin proteins. 

A microbial globin-type sequence was generated from the results of a previous 
BLAST search on microbial globins and included Vitreoscilla hemoglobin for structural 
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markers. This sequence was used to extract 9 bacterial and 8 eukaryotic hemoglobins and 
flavohemoproteins. This sequence was generated to incorporate microbial globins into 
the search of a combined globin motif. Manipulation and alignment of the microbial 
globin-type peptide with MbLP and incorporating the same marker residues produced a 
5 second consensus sequence 96-amino acids in length called the triplet globin motif 
(TGM) because it consisted of three domains: two myoglobin-type domains (Ml -box, 
M2-box) and one hemoglobin-type domain (H-box). TGM was the final sequence used 
for further analysis and BLASTP searches with the TGM sequence were performed at a 
lower EXPECT parameter of 600 to reduce the amount of false-positives. 

1 0 The ability of the myoglobin motif to recognize myoglobins was tested using 

SWMb as a reference. A BLASTP search of the non-redundant protein database was 
performed using the 153-aa native sperm whale myoglobin (SWMb) as the query 
sequence. This sequence recognized 83 unique myoglobins and a wealth of hemoglobins. 
With some manipulation of the search conditions, however, SWMb was able to extract 

1 5 HemAT-Hs as well. 

A first attempt at a globin-type motif produced the 80-aa myoglobin-like protein 
(MbLP) sequence consisting of two domains, the Ml -box and M2-box, as found in Figure 
3. These two domains recognized 73 myoglobins, or 88% of those found by SWMb, 
along with HemAT-iis', KemAT-Bs, and a few non-globins. In contrast, however, MbLP 

20 didn't recognize any hemoglobins. 

An effort was made to enhance the globin-type motif of MbLP by building upon 
itself. This effort resulted in the 96-aa triplet globin motif (TGM) protein sequence and 
consisted of three domains: the Ml-, M2-box, and a new H-box situated in front of the 
two. The TGM sequence was compared to the MbLP and SWMb by subjecting it to the 

25 same BLASTP search analysis. TGM recognized 75 myoglobins (90% of SWMb hits), 
17 hemoglobin and hemoglobin-like proteins, and the two HemATs. The 17 hemoglobin 
and hemoglobin-like proteins consisted of 5 non-microbial eukaryotic hemoglobins from 
three different organisms and 12 microbial hemoglobins, three eukaryotic and nine 
bacterial. It is evident that the TGM sequence is more general than MbLP in recognizing 

30 globin motifs. 
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Table 2 Alignment and classification of some of the resultant proteins in the M2- 
box region using TEMPLATE as the template. Shaded residues are conserved in their 
respective boxes (H-, Ml-, M2-box). 



l Secondary hhhhhhhhhhhhllllllllHHHHHHHHHHHllllhhhhh 
E >| |< F >| |<G 

| < M2 Box > | 

TEMPLATE tmpseq_l 58 XXXXXXXXXXXXXXXXXXAQRXR-LAQlgAXKGKIgDWjL 96 

SEQ. ID. No. 3 0 * S 

MYG_PHYCA P02185 66 VTVLTALGAILKKKGHHEAELKP-LAQSjfATKHKIglKtL 104 
SEQ. ID. No. 31 

MYG_KOGSI P02184 66 VTVLTALGAILKKKGHHEAELRP-LAQS||ATKHKIgIKlL 104 

SEQ . ID. No. 32 "" * m 

MYG_ROUAE P02163 66 ATVLTALGGILKKKGQHEAQLKP-LAQS§ATKHKI|VJ^L 104 

SEQ. ID. No. 33 

MYG_TURTR P02172 6S NTVLTALGAILKKKGHHDAELKP-LAQS|jATKHKI|IKgL 104 

SEQ. ID. No. 34 

MYG_GLOME P02174 66 NTVLTALGAILKKKGHHEAELKP-LAQSSATKHKlgll^L 104 

SEQ. ID. No. 35 

MYG_WHAUK JT0636 66 VTVLTQLGKILKQKGNHESELKP-LAQT^ATKHKI^VKjL 104 

SEQ. ID. No. 36 

HemAT-Bs CAA74545 96 LKRHIQEMFAGVIDDEFIEKRNR- IASlBLRIGLLgKWjpyi 134 
SEQ. ID. No. 37 

MYG_MDSAN P14399 60 ADTVLSALGNIVKKKGSHSQPVKALAATHITTHKI|PH|F 99 

SEQ. ID. No. 38 

HemAT-Jfs 1654421 



The secondary structure reported in Figure 3 is that of SWMb and is considered 
typical of the globins. It was interesting to note that the domain responsible for HemAT 
recognition, the M2-box, lies in the region between the F and G helix, which contains the 
proximal histidine. Alignments indicate that two loop regions of the HemATs (CD and 
EF loops) are much more extensive than in SWMb. The M2-box does not include the 
HemATs' distinctive EF loop, thereby allowing recognition of both of the transducers. 
The Ml -box not only includes the B and C helix, but also specifies the entire CD loop 
region, which, inadvertently, ends up excluding the HemATs. 

The H-box recognizes primarily microbial hemo-globins/proteins, which is 
equivalent in position to the last two-thirds of helix A and the first third of helix B of 
SWMb. This region is highly significant, as it is could help place a sequence like 
TEMPLATE in an phylogenetic tree, thereby connecting the eukaryotic and eubacterial 
hemoglobins with the myoglobins and myoglobin-like proteins. The Ml -box, containing 
what would be helices B and C from SWMb, incorporates one of the absolutely 
conserved residues, Phe, from the CD region and only pulls out myoglobins from higher 
species. Though the match scores are much lower, the M2-box pulls out almost the same 
myoglobins as the Ml -box, however, recognition of HemAT -Hs and HemAT-Zfr occurs 
only in the M2-box. 
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Based upon the extensive information available regarding conserved structures in 
the proteins, as well as the minimal functional regions, one can predict modifications to 
the proteins which will not alter the function of the protein. 

The present invention also provides a blood substitute. An urgent need exists 
5 among the medical community for an alternative to whole blood or red blood cells for use 
in transfusion. However, the possibility of transmitting viral infections is ever present in 
derivatives of human blood. The rapid spread of the AIDS virus as well as the discovery 
of multiple forms of the virus amplifies this concern. Both Hem AT -Hs and/or HemAT- 
Bs may present an alternative to whole blood in transfusion situations. HemAT-Hs and 

10 HemAT-ite are particularly attractive in this regard, since they appear to have low oxygen 
affinity, a property required for artificial oxygen carriers. Expressed in microorganisms, 
these oxygen carriers will be free of infectious agents. In addition, the ability of HemAT- 
Hs and HemAT-5s to bind oxygen reversibly and to regulate this binding through the 
signal transduction domain may lead to new blood substitute products. Currently cross- 

1 5 linked hemoglobins are being developed as blood substitutes but these proteins suffer 
from poor regulation of oxygen binding. It is possible to develop HemAT-Hs and 
HemAT-i?s as a "blood-substitute" due to their size (~50 kDa, similar to hemoglobin) 
which prevents filtering by the kidney's, its ability to reversibly bind oxygen, and its 
ability to regulate this binding. Genetically engineered fragments of the hemAT-Hs and 

20 hemAT-Bs genes that encode of the transduction domain provides a wealth of 
opportunities to regulate oxygen binding. 

In a preferred embodiment, the blood substitute has a heme binding domain of the 
isolated heme-binding protein. The blood substitute may also have a heterologous signal 
transduction domain, to alter the affinity for oxygen or other gases. 

25 The blood substitute may be administered to a patient suffering from low blood 

levels. Such a blood substitute has numerous advantages because it could be used as a 
substitute when whole blood is not available. Furthermore, the blood substitute can be 
produced so that it is free of infectious substances, such as viruses and bacteria. 

In addition to using heterologous signaling domains, the oxygen binding of the 

30 heme-binding protein may be altered by modifying the signaling domain. 

The invention also provides a method for controlled storage of oxygen. The 
bacterial heme binding protein can be contacted with oxygen allowing the protein to bind 
and store oxygen. The protein may also be covalently attached to a solid substrate via the 
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transduction domain. Subsequent triggering of the transduction domain can result in 
oxygen release. 

The present invention may also be used to sense gaseous ligands by exposing the 
bacterial heme binding protein to a sample to be tested and measuring a change in the 
5 conformation of the protein. Enzyme sensors are well known as biological sensors. They 
are utilized mainly for clinical chemical analysis, including use for glucose in blood, urea 
and neutral and phospholipids. The ability of HemAT-Hy and HemAT-Ss to sense 
oxygen as well as other small gaseous ligands provides opportunities to develop novel 
biosensors for O2, NO, CO, and even CN". 
10 The changes in the conformation of the protein may be monitored in various ways 

including monitoring the protein optically or electronically. 

The preferred gaseous ligands to monitor with the heme binding bacterial protein 
are O2, NO, CO, and CN". The preferred gaseous ligand is O2. 

As discussed above fragments of the bacterial heme binding protein may also be 
15 used as long as they contain a functional heme binding domain. 

The present invention also provides a chimeric protein having a heme-binding 
domain of an isolated heme binding bacterial protein and a heterologous signaling 
domain. Varying the signaling domain can alter the oxygen or ligand binding 
characteristics of the protein. The signaling domain may also be altered to make the 
20 protein responsive to other signals. 

In another embodiment, the invention provides an isolated nucleic acid molecule 
which encodes a bacterial heme binding protein with a heterologous or mutated signaling 
domain. 

The bacterial heme-binding proteins may also be used for heme-based catalysis. 

25 It is well known that Fe(III)porphyrins can catalyze a wide variety of chemical reactions 
including hydrogen peroxide degradation, mono oxygenation, and lignin degradation. 
HemAT-iifc can also be prepared in the Fe(III) form providing an opportunity to utilize 
this protein as a novel heme-based catalyst. In addition, the ability to regulate the heme 
domain by the transduction domain may allow for catalytic specificity to be achieved via 

30 genetic manipulation of this domain. 

The proteins of the present invention may also be used for artificial 
photosynthesis. HemAT-Hs can be reconstituted with different porphyrins including 
photoactive Zn and Sn derivatives. These derivatives may posses the ability to absorb 
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light energy and transmit information concerning the excited state of the photoactive 
poprhyrin to the sensing domain providing-the equivalent to photosynthesis, i.e., 
conversion of light energy to chemical potential energy. 

The bacterial heme binding proteins may also be used in in vivo and in vitro 
5 testing system for identifying potential signaling functions of mutated a-hemoglobin and 
myoglobin causing several diseases. Mutated human oc-hemoglobin and myoglobin genes 
can be fused with fragment of hemAT-Hs or hemAT-Bs genes that encodes signaling 
domain via linker region. The physiological function of the expressed chimeric protein of 
human a-hemoglobin (or myoglobin) and HemAT-i7s or HernAT-ifa can be tested by 
1 0 capillary aerotaxis assay. As transducer proteins HemAT-Hs and HemAT-fe may cause 
phosphorylation of CheA. Once this feature of HemAT-Hs and HemAT-ifc are tested and 
optimized, similar in vivo strategy of chimeric protein construction can be tested for in 
vitro phosphorylation assay. 

15 EXAMPLES 

Example 1 - Mutagenesis of HemAT-ifs and HemAT-J?s 

The HtrVIII is a positive aerotaxis transducer in H. salinarum (Brooun et al., I 
20 Bacteriol . 1 80: 1 642- 1 646 (1 998), which is hereby incorporated by reference). A strain 
deleted for the htrVIII gene lacks positive aerotaxis while a strain overproducing the 
protein shows an enhanced aerotactic response. To investigate the possible role of 
HemAT-/fo and HemAT-i?s in aerotaxis, deletion mutants of these genes were 
constructed (Brooun, Ph.D thesis. University of Hawaii, Hawaii (1997), which is hereby 
25 incorporated by reference) for the construction of hemA T-Hs deletion strains. 

Construction of overexpression of hemAT-Hs in H. salinarum: Ndel and Xbal restriction 
sites were used to clone the hemAT-Hs gene into the E. coli-H. salinarum shuttle vector 
pKJ427. Top primer with Ndel cutting site (5 ' CCGAATTCCATATGAGCAACGAT 
AATGAC 3' (SEQ. ID. No. 40)) and bottom primer with Xbal cutting site (5'CCTCTA 
30 GAGGATEECTAGCTGAGCTTGCCGACC 3 ' (SEQ. ID. No. 4 1 )) were synthesized 
and used for PCR amplification of hemAT-Hs gene. The PCR amplicon was cloned into 
TOPO cloning vector (Invitrogen) and transformed into E. coli competent cells. The 
plasmid containing hemAT-Hs gene in TOPO vector was subcloned into pKJ427 vector 
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by NdeVXbal double digestion. The hemAT-Hs/p¥J427 construction was confirmed by 
PCR as well as NdeVXbal double digestion and transformed into AhtrVIII strain using 
standard halobacteria transformation protocol. Individual colonies were checked by PCR 
and immunoblot to confirm the expression level of HemAT-Hs; Construction of OI3428: 
5 A 322 bp fragment interior to HemAT-fe was amplified from the B. subtilis wild type 
strain OI1085 chromosome using primers with overhanging Hindlll and BamHl sites 
(reverse primer: 5' TATGGGATCCCTTGTTCATCACGGGTCTETTGG 3' (SEQ. 
ID. No. 42), forward primer: 5' GATAAAGCTTGATCATAGCTCAGTTGACCG 3' 
(SEQ. ID. No. 43)). This PCR fragment was digested with Hindlll and BamHl and 

10 cloned in the integration vector pHV501 (Vagner et al., Microbiology . 144(Pt 1 1):3097- 
3104 (1998)) to create pMKl. The resultant plasmid pMKl was transformed into Oil 085 
and HemAT-ifa mutants were selected by erythromycin resistance. Integration of the 
pMKl into the correct locus was checked by linkage analysis. The hemAT-Bs locus is 
30% linked to the glyk locus as determined from the B. subtilis chromosomal map. 

1 5 GL Y+ transductants were selected and scored for erythromycin resistance. Construction 
of OI3498: The entire HemAT-Bs gene including the native promoter and the ribosome 
binding site was amplified from the B. subtilis wild type strain Oil 085 chromosome using 
primers with overhanging EcoRl and BamHl sites (HemAT-fe amy up: 5' 
TGCTGAATTCGCAGCTTTCATTCATGTTTCCC 3'(SEQ. ID. No. 44), HemAT-^ 

20 amydown: 5' TTAGGGATCCGTCAACTGATTTTTAA TTTAAGTTAC 3') (SEQ. 
ID. No. 45)). The PCR amplicon was digested with EcoRl/BamHl and cloned into the 
amyE integration vector pDG1730 (Guerout-Fleury et al, Gene , 180(l-2):57-61 (1996), 
which is hereby incorporated by reference) to produce pKZ2. The resultant plasmid 
pKZ2 was digested with BgR/Xbal to ensure a double crossover event into the amyE 

25 locus and then transformed into OI3428 to select for Spec-R. HemAT-5^ overexpression 
R4: Overexpression construction in E. coli: The HemAT-ifo overexpression construction 
was performed as follows: B. subtilis OI 1085 genomic DNA was used for the PCR 
amplification of HemAT-ifr gene by Pfu DNA polymerase using two primers (Top 
primer with BamHl restriction site: 5'ATATGGATCC 

30 AAGGGGGATC ATTGTAATGTTATTTAAAAAAG3 ' (SEQ. ID. No. 46), Bottom 
primer with Pstl site: 5' ATTACTGCAGCAACTGATTTTTAATTTAAGTTT 
ACATAATGAACGC 3' (SEQ. ID. No. 47)). The PCR amplicon was cloned into TOPO 
cloning vector (Invitrogen) and transformed into TOP 10 E. coli competent cells. 
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Colonies were tested for the presence of plasmids containing the correct insert. The 
recombinant plasmid was digested with BamHl and Pstl and the insert with HemAT-fe 
open reading frame was cloned into the pMALcII expression vector (New England 
Biolabs, Inc). 

5 

Example 2 — Aerophilic and Aerophobic Responses 

The resultant construct was transformed to E. coli pLysS cells for the expression 
and analyzed their behavior in a flat microcapillary using dark-field microscope coupled 

10 with time-lapse digital video system. Motile wild-type halobacterial cells form two clear 
congregated aerotactic bands, a positive one close to the interface between air and cell 
suspensions and a negative one away from the interface (Figure 5A, wild type). The 
positive aerotactic band is mediated by HtrVIII (Brooun et al., J. Bacteriol. , 180:1642- 
1646 (1998), which is hereby incorporated by reference). As expected, this phenomenon 

15 is absent in the htrVIII deletion strain (Figure 5 A, HemAT-/fe+AHtrVIII). However, like 
the wild type strain, the AHtrVIII strain also demonstrates the negative aerotactic band. If 
negative aerotaxis behavior is related to HemAT-ift, one would postulate that in the 
hemAT-Hs deletion strain, the negative aerotactic band would not form. Indeed, in the 
AhemAT-Hs strain, in which the positive aerotactic band is present due to the receptor 

20 HtrVIII, the sharp boundary of the negative aerotactic band is absent (Figure 5A, 

AhemAT-Hs ). Furthermore, when HemAT-Hs is overexpressed (using a multicopy 
plasmid) in a AhtrVIII strain, halobacterial cells form a more pronounced negative 
aerotaxis boundary (Figure 5A, HemAT-ifr+ AHtrVIII). These cells were repelled from 
the air/liquid interface much faster and created a denser aerotactic band than the 

25 aerotactically wild type or AHtrVIII strains containing genomic copy of hemAT-Hs 

(Figure 5A, wild type and HemAT-i£y+AHtrVTII). The aerophilic response in B. subtilis 
proceeds more rapidly than it does in H. salinarum (30 versus 180 min) because B. 
subtilis swims faster than H. salinarum. In the wild-type, an aerotactic band formed at 
the air interface (Fix. 5B). This band did not form in a strain from which all ten putative 

30 MCP-like transducers (Aten) were deleted (Fig. 5B). A strain lacking only HemAT-ifo 
showed an aerophobic response, indicating the presence of a second, unidentified 
aerotaxis receptor. To demonstrate the physiological function of HemAT-ifa 
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unequivocally, hemAT-Bs was overexpressed in a strain from which all B. subtilis 
transducer genes were deleted (Aten strain). When HemAT-^ was overexpressed in the 
Aten strain, the aerophilic response was observed (Fig. 5b). These assays demonstrate 
that HemAT-Bs is involved in an aerophilic response in B. subtilis. 

5 

Example 3 - Expression of HemAT-IZs and HemAT-Bs in Escherichia coli 

The FAD-binding aerotaxis transducer Aer in E. coli has a PAS domain that is 
similar to the redox-sensing domain of the NifL protein of Azotobacter vinelandii (Hill et 

10 al., Proc. NatL Acad Sci. USA , 93:2143-2148 (1996); Zhulin et al., Mol. Microbiol. , 
29:1522-1523 (1998), which are hereby incorporated by reference) and FixL from R. 
meliloti (Gilles-Gonzalez et al, Nature . 350:170-172 (1991), which is hereby 
incorporated by reference). FixL is a chimeric membrane protein with a histidine kinase 
domain, which belongs to the large class of two-component regulatory systems, whereas 

15 the heme-binding sensory domain belongs to the PAS domain super family (Gilles- 
Gonzalez et al., Nature . 350:170-172 (1991); Lois et al., J. Bacteriol. , 175:1103-1 109 
(1993); Gong et al., Proc. Natl. Acad. Sci. USA . 95:15177-15182 (1998), which are 
hereby incorporated by reference). None of the PAS domains identified in the genome of 
B. subtilis is present in chemotaxis transducers (Zhulin et al., Mol. Microbiol. . 29:1522- 

20 23 (1998), which is hereby incorporated by reference). To identify the nature of the 

prosthetic groups in HemAT-/fc and HemAT-ik, both proteins were expressed in E. coli 
by constructing vectors, which express the hemAT-Hs or hemAT-Bs gene under the 
control of an inducible T7 promoter (Studier et al., Methods in Enzymology , 185:60-89 
(1990), which is hereby incorporated by reference). 

25 Using a combination of anion exchange and gel-filtration chromatography, 

HemAT-Zfi- was purified (The BL21 pLysS host cells harboring hemAT-Hs or hemAT-Bs 
genes were grown to OD 6 oo = 0.4 in 1L of LB with appropriate antibiotics and induced 
with 0.6 mM IPTG. The cells were harvested by low speed centrifugation (4000xg) for 
15 min. at 4°C after a two-hour induction. The pellets were resuspended in 50 ml buffer 

30 (50 mM NaCl, 50 mM Tris-HCl, pH6.0) and sonicated for a total of 4 minutes (20 second 
pulses with 30 second pauses). The sonicated solution was centrifuged at 28,000xg for 20 
min. The brown red supernatant with HemAT-Hs or HemAT-Zfc was used for 
purification. HemAT-Hs: The supernatant was filtered through 0.2 micron filter and 
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applied to BioCAD anion exchange POROS HQ/M (16/100) perfusion chromatography 
column equilibrated with 50 mM Tris-HCI, pH6.0. A linear gradient of NaCl (0-1500 
mM) was applied and HemAT-ife was eluted at about 400 mM. For further purification, 
the fractions containing the HemAT-ife (monitored by Soret band absorbence at 410 nm 
5 and SDS-gel electrophoresis) were concentrated and applied to a Hiload Superdex 200 
1 6/60 gel filtration column. Peak fractions were concentrated with an Amicon 100K 
concentrator and used for spectroscopy. HemAT-Us: A saturated (NH^SC^ solution 
was added to the brown red supernatant to 30% and centrifuged at 28,000xg for 20 min. 
The optically clear light brown supernatant was further fractionated by (NH^SC^ 

10 addition to 36% saturation followed by centrifugation. The resultant pellet was 

solubilized in a resuspension buffer (500mM NaCl, 50mM Tris-HCI, pH8) and applied to 
a Hiload 26/60 Superdex 75 gel filtration column. Peak fractions containing HemAT-Bs 
(monitored by Soret band absorbence at 410 nm and SDS-gel electrophoresis) were 
concentrated by an Amicon 5 OK concentrator and used for spectroscopy). Recombinant 

1 5 HemAT-Hs- expressed in E. coli under low ionic strength conditions was shown to contain 
a high degree of secondary structure consistent with a predicted folded protein (Larsen et 
al., J. Prot. Chem. . 18(3) (1999), which is hereby incorporated by reference). 

The purified HemAT-iZy migrates at a position higher than the calculated 52.8 
kDa for the mature protein (Figure 2B line HemAT-TZs). This slow electrophoretic 

20 migration in SDS-polyacrylamide gels is consistent with the highly acidic nature of 

HemAT-ffe (pl=3.78, 27% acidic residues) and has been observed in other acidic proteins 
from halophiles (Ihara et al., Arch. Biochem. Biophys. , 286:1 1 1-1 16 (1991), which is 
hereby incorporated by reference). Using a combination of ammonium sulfate 
precipitation/fractionation and gel filtration chromatography it is possible to purify 

25 HemAT-55. The BL21 pLysS host cells harboring hemAT-Hs or hemAT-Bs genes were 
grown to OD600 = 0.4 in 1L of LB with appropriate antibiotics and induced with 0.6 mM 
IPTG. The cells were harvested by low speed centrifugation (4000xg) for 1 5 min. at 4°C 
after a two-hour induction. The pellets were resuspended in 50 ml buffer (50 mM NaCl, 
50 mM Tris-HCI, pH6.0) and sonicated for a total of 4 minutes (20 second pulses with 30 

30 second pauses). The sonicated solution was centrifuged at 28,000xg for 20 min. The 
brown red supernatant with HemAT-Hs or HemAT-Ry was used for purification. 
HemAT-Hs: The supernatant was filtered through 0.2 micron filter and applied to 
BioCAD anion exchange POROS HQ/M (16/100) perfusion chromatography column 
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equilibrated with 50 mM Tris-HCI, pH6.0. A linear gradient of NaCl (0-1500 mM) was 
applied and HemAT-Hs was eluted at about 400 mM. For further purification, the 
fractions containing the UemAT-Hs (monitored by Soret band absorbence at 410 nm and 
SDS-gel electrophoresis) were concentrated and applied to a Hiload Superdex 200 1 6/60 
5 gel filtration column. Peak fractions were concentrated with an Amicon 100K 

concentrator and used for spectroscopy. HemAT-2?s: A saturated (NH4) 2 S0 4 solution 
was added to the brown red supernatant to 30% and centrifuged at 28,000xg for 20 min. 
The optically clear light brown supernatant was further fractionated by (NH4) 2 S04 
addition to 36% saturation followed by centrifugation. The resultant pellet was 

10 solubilized in a resuspension buffer (500mM NaCl, 50mM Tris-HCI, pH8) and applied to 
a Hiload 26/60 Superdex 75 gel filtration column. Peak fractions containing HemAT-fe 
(monitored by Soret band absorbence at 410 nm and SDS-gel electrophoresis) were 
concentrated by an Amicon 50K concentrator and used for spectroscopy). The purified 
HemAT-Bs migrates in SDS-PAGE as 48.7 kDa protein as expected (Figure 2, line 

15 HemAT-Ss). 

Example 4 — Absorption Spectra of Purified UemAT-Hs and HemAT-Bs 

UemAT-Hs and HemAT-Bs display similar absorption spectra in both the near 
20 UV and visible regions characteristic of oxygen bound heme proteins. Specifically, 

absorption band maxima are found at 406 nm (Soret), 578 nm (a-band), and 538 nm (P- 
band) for both proteins (Figure 4A). These absorption maxima resemble those of Sperm 
whale oxymyoglobin (418 nm, 581 nm, and 543 nm) and oxy FixL (415 nm, 577 nm, and 
543 nm). Upon deoxygenation (using sodium dithionite), the Soret bands shift to 425 nm 
25 while the a- and P-bands converge to a broad band centered at 555 nm, consistent with 
the formation of a deoxy-form of the protein (i.e., absorption bands for deoxymyoglobin: 
434 nm and 556 nm and deoxyFixL: 433 nm and 567 nm) (Figure 4B). If the deoxy form 
of HemAT-Hs and HemAT-lfa are exposed to atmospheric oxygen, the absorption spectra 
revert back to that observed for the purified proteins (Figure 4D). Both the purified (oxy 
30 form) and the deoxy derivatives of HemAT-Hs and HemAT-Zfo are reactive towards 
carbon monoxide. The CO bound derivatives display absorption maxima at 415 nm 
(Soret), 573 nm (a-band), and 535 nm (P-band) (Figure 4C). A pyridine hemochrome 
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assay showed the heme group of both HemAT-iTs and HemAT-ifo to be b-type. HemAT- 
Hs and HemAT-Zte are distinct both in spectral features and in physiological function 
from the previously discovered heme protein FixL from R. meliloti (Gilles-Gonzalez et 
al., Nature , 350:170-172 (1991), which is hereby incorporated by reference). The 
5 absorption bands of both HemAT-i^s and HemAT-ifo are blue shifted, relative to FixL, 
indicating distinct heme pocket geometries. Unlike FixL, HemAT-ift and HemAT-#s 
display no PAS domain sequence homology. In addition, both HemAT-/fc and HemAT- 
Bs participate in negative aerotaxis while FixL acts as an oxygen sensing kinase. 

1 0 Example 5 - Methylation of HemAT-Fs and HemAT-#s 

It has been postulated that in E. coli, adaptation in Aer-mediated aerotaxis is 
methylation-independent (Taylor et al., Annu. Rev. Microbiol. , 53:90-103 (1999), which 
is hereby incorporated by reference). In contrast to E. coli, adaptation during aerotaxis in 

15 H. salinarum and B. subtilis is a methylation-dependent process (Brooun et al., J. 

Bacteriol. . 180:1642-1646 (1998); Lindbeck et al.. Microbiology . 141:2945-2953 (1995); 
Wong et al., J. Bacteriol. . 177:3985-3991 (1995), which are hereby incorporated by 
reference). To determine if HemATs can be methylated by the CheR methyltransferase, 
H. salinarum and B. subtilis cells were radiolabeled with [methyl-^ H] methionine after 

20 blocking protein synthesis. The radiolabeled cells were processed for fluorography and 
immunoblotting with a polyclonal antibody raised against the highly conserved region 
of methyl-accepting transducers (W. Zhang et al., Proc. Natl. Acad. Sci. U.S.A. 
93:4649 (1996), which is hereby incorporated by reference). A single radiolabeled band 
is missing in the AhemAT-Hs strain (Figure 2B, lane 1), whereas this band is present in 

25 the overexpression strain (Figure 2B, lane 2). This band is also recognized by the 
antibody, suggesting that HemAT-Hs is indeed a methyl-accepting transducer 
(Figure 2B, lanes 1' and 2'). In contrast, it was not possible to detect any 
[methyl-^ H]-labeling in HemAT-5s. Together with the capillary assays, these data 
demonstrate an important difference in the signaling and adaptation mechanisms 

30 for aerotaxis mediated by HemAT-i/s and HemAT-i?.?. 
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Example 6 - Materials and Methods for Example 7 

PCR and TOPO cloning. C-terminal primers were designed to amplify the 
5 250,230,210, 205, 200, 195, 194, 193, 192, 191, 190, 170 and 151 residues of HemAT-ifr 
and were designed to include BamHl and Xbal restriction sites. The N-terminal primer 
included EcoRl and Ndel restriction sites. Primer sequences may be found in Table 1. 
HemAT-i/s genomic plasmid was used as a template for PCR with Pfu polymerase. PCR 
amplification was performed in a GeneAmp PCR system 2400 (Perkin-Elmer) under the 

10 following conditions: Hot start with Pfu polymerase at 80°C followed by heat 

denaturation at 94°C for 2 minutes was followed by 25 cycles of heat denaturation at 94°C 
for 30 seconds, primer annealing at 58°C for 30 seconds and elongation at 72°C for 40 
seconds. Following the last cycle, samples were maintained at 72°C for 7 minutes and 
immediately kept at 4°C. Following PCR, the PCR product was immediately cloned into 

15 the TOPO vector (Invitrogen) and transformed into TOP 10 competent cells. Clones with 
the insertion were selected via kanamycin resistance on Luria Bertani (LB) agar plates 
with kanamycin (50jj,g/ml). Colonies were inoculated into CircleGrow (BIO 101) with 
kanamycin media and, following incubation, plasmids were isolated via alkaline mini 
prep. Plasmids were then restricted with EcoRl to screen for the proper insert. 

20 Cloning into pMAL expression vector. Plasmids containing the correct insert 

and the expression vector, pMAL-c2, were then digested with EcoRl and BamHl. pMAL- 
c2 was subsequently dephosphorylated with Alkaline Phosphatase. Digested TOPO 
plasmids and pMAL plasmid were run on a 1% preparative agarose gel. The truncated 
hemAT-Hs PCR insert and double digested pMAL-c2 bands were cut from the gel and the 

25 DNA was extracted from the gel using the GENECLEAN Spin Kit (BIOl 01). The 

hemAT-Hs insert was then ligated to the pMAL-c2 vector at 14°C, overnight. Following 
ligation, the ligation mixture was transformed into JM109 competent cells. Clones 
containing the plasmid were selected for by ampicillin resistance on LB agar Amp 
(100(xg/ml) plates. Ampicillin resistant colonies were inoculated into CircleGrow + Amp 

30 media and incubated. Plasmids were isolated via alkaline mini prep and the hemAT-Hs 
insert was screened for by double digest with EcoRl and BamHl. 

Transformation into expression host and protein expression. Plasmids 
containing the insertion were then transformed into BL21 pLysS competent cells 



-27- 



(Novagen). Clones containing both the pMAL-hemAT-Hs insertion plasmid and the 
pLysS plasmid were screened for by ampicillin (100u.g/ml) and chloramphenicol 
(34jig/ml) resistance on LB agar plates. To check for expression of the truncated MBP- 
HemAT-Hs fusion protein, cells were inoculated into LB Amp and Chi broth and grown 
5 to an OD6oo=0.4 followed by induction with 1 mg/ml IPTG. After induction for 1 .5 
hours, protein samples of uninduced and induced cultures were prepared and run on a 
10% SDS-PAGE. This was then followed by staining for protein with Coomasie Blue 
and destaining with 10% acetic acid. 

Protein purification by affinity chromatography and spectral analysis. 

1 0 Cultures which showed induction of the MBP-HemAT-Zfe protein were then grown up in 
a larger scale to OD<5oo = 0.4 and induced with IPTG (1 mg/ml). Induced cultures were 
then centrifuged at 5,000 rpm for 20 minutes at 4°C followed by a wash with column 
buffer (20mM Tris-HCI, 200mM NaCl, ImM EDTA) and centrifuged again at 5,000 rpm 
for 20 minutes at 4°C. If purification did not immediately follow the wash, protein pellets 

1 5 were stored at -70°C. Protein pellets were then resuspended in column buffer, sonicated 
for 2 minutes (20 second pulses at 45 second intervals) resuspended in column buffer, 
sonicated for 2 minutes (20 second pulses at 45 second intervals) and centrifuged at 
15,000 rpm for 20 minutes at 4°C. The protein containing supernatant was decanted, 
diluted 1 :2 and stored on ice. After setting up the amylose resin column (New England 

20 BioLabs), it was washed with 8 column volumes of cold column buffer. The sample was 
then loaded onto the column at a flow rate of 1 ml/min. followed by a 12 column volume 
wash with cold column buffer. MBP-HemAT-Tis protein was eluted with 10 mM maltose 
column buffer and collected in 1 ml fractions. Samples containing the most protein were 
used to determine the spectra via spectrophotometer. Following elution, a 10% SDS- 

25 PAGE was also often run to determine the amount of protein in elutions. Eluted samples 
were stored at 4°C. 
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Table 3. Names and sequences (5' to 3') of primers used in HemAT-Hs truncation. 



Primer Name 


Sequence (5' to 3') 






hemAT-Hs EcoRVNdel top 


ccgaattccatatgagcaacgataatgac 


SEQ. ID. No. 48 


hemAT-Hs 151 Bamffl/Xbal bot 


ctctagaggatccctagtcgtcggcaagcgcgtcc 


SEQ. ID. No. 49 


hemAT-Hs 250 B/X bot 


cctctagaggatccetagacgtcagccatgcggtc 


SEQ. ID. No. 50 


hemAT-Hs 230 B/X bot 


cctctagaggatccctaggcgacgtcctgcgaggtcgcc 


SEQ. ID. No. 51 


hemAT-Hs 210 B/X bot 


cctctagaggatccctacgcgttcgccaactcctggcggc 


SEQ. ID. No. 52 


hemAT-Hs 190 B/X bot 


cctctagaggatccctagatgtaggtgtccattgcgatc 


SEQ. ID. No. 53 


hemAT-Hs 170 B/X bot 


cctctagaggatccctaccgggccacgagttcgtcgac 


SEQ. ID. No. 54 


hemAT-Hs 205 B/X bot 


cctctagaggatccctactggcggctgtcgatctcgtc 


SEQ. ID. No. 55 


hemAT-Hs 200 B/X bot 


cctctagaggatccctactcgtcgtggaggcgctgggc 


SEQ. ID. No. 56 


hemAT-Hs 195 B/X bot 


cctctagaggatccctactgggcgtacgagtcgatgtag 


SEQ. ID. No. 57 


hemAT-Hs 194 B/X bot 


cctctagaggatccctaggcgtacgagtcgatgtaggtgtcc 


SEQ. ID. No. 58 


fewJr-ifcl93B/Xbot 


cctctagaggatccctagtacgagtcgatgtaggtgtcc 


SEQ. ID. No. 59 


hemAT-Hs 192 B/X bot 


cctctagaggatccctacgagtcgatgtaggtgtccattgcg 


SEQ. ID. No. 60 


hemAT-Hs 191 B/X bot 


cctctagaggatccctagtcgatgtaggtgtccattgcg 


SEQ. ID. No. 61 



Example 7 - Truncated HemAT-Hs 

5 

The finding that HemAT-Hs, an archael signal transducer, is a heme binding 
protein provides a unique opportunity to study not only the physiological function of this 
protein, but also obtain greater understanding of the structure of this soluble protein and 
how heme interacts with it. Therefore, this project aims to identify the minimum size of 

10 HemAT-Hs to which heme binds. This will be done by truncating the gene, first from the 
C - terminal, by PCR. Once the minimum size of the functional heme binding domain is 
found from the C - terminal, the N - terminal will then be truncated to further identify 
residues crucial in proper heme binding. Producing this truncated HemAT-Hs protein 
which still retains the functional heme binding domain will aid in efforts to determine 

15 HemAT-Hs protein structure. 

Analysis of heme binding in HemAT-Hs began with the first 151 residues of 
HemAT-Hs. However, preliminary spectral analysis showed no heme binding. Primers 
were then designed to amplify HemAT-Hs every 20 amino acids from 150 thereby 
amplifying the first 170, 190, 210, 230, and 250 amino acids of the N-terminal. HemAT- 

20 Hs 210 showed the spectra of heme bound to HemAT-Hs and also exhibited the 

characteristic red spectra of O2 bound heme in purified protein samples. HemAT-H? 1 90, 
however, did not present color in protein samples, nor did it have the visible bands at 540 
nm and 580 nm which represent bound heme. Primers were then designed every 5 amino 
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acids from 210 to 190 at 205, 200 and 195 to determine more precisely where heme 
binds. Only 200 and 195 construction fused with MBP showed a reddish color in protein 
samples along with the characteristic spectra. 

This narrowed the search down for the heme binding site to between 195 and 190 
5 amino acids of HemAT-Hs; thus, primers were designed at 191, 192, 193 and 194 amino 
acids. The spectra for the 192 and 191 constructs shows altered visible bands at 540nm 
and 580nm. The 194 construction shows a similar spectra, like wild-type HemAT--ffi> or 
the HemAT -Hs 195 construct. 

10 

Example 8 — Purification of Recombinant HemAT-Hs by Metal Chelate 
Chromatography 

1 L of E. coli culture containing HemAT-ifc was collected, washed with Buffer #2 
15 ( 200 MM NaCl, 50 mM Sodium phosphate, pH 8.0), and resuspended in 40 ml of Buffer 
#2. Cells are sonicated. Insoluble material is removed by ultracentrifugation at 100,000 
rpm for 20 minutes. POROS MC/M (1 00 X 1 .6 I.D., 20 um) is used for metal chelate 
chromatography. The column is washed with 50 mM EDTA, I M NaCl, pH 8.0 over 10 
column volumes followed by a wash with water. 100 mM CoCl 2 is used to charge the 
20 column, followed with a wash with 1 M NaCl and water. The column is equilibrated in 
buffer containing 5 mM imidazole. 5 ml of sample is loaded directly onto the column at a 
flow rate of 2-4 ml/min and a gradient of imidazole from 0-500 mM is run over 30 
column volumes at 1 0 ml/min. Fractions containing recombinant HemAT-/fc is pooled 
and concentrated using Centricon 50. 
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Example 9 - Site Directed Mutagenesis of HemAT-Hs 

In order to perform PCR-based site-directed mutagenesis, a plasmid containing 
5 the hemAT-Hs gene to be mutated with proper size has to be constructed first. The proper 
restriction recognition sites are created by designing the primers with the recognition site 
tags in the primers as follows: 

hemAT-Hs EcoRl/Ndel top primer: 

5' CCGAATTCCATATGAGCAACGATAATGAC 3' (SEQ. ID. No. 62) 
1 0 hemAT-Hs BamHl/Xbal bottom primer: 

5' CCTCTAGACTAGCTGAGCTTGCCGACC 3' (SEQ. ID. No. 63) 
Two sites in each primer were created to meet the needs of expressing HemAT-Hs 
in different expression vectors. hemAT-Hs genomic DNA in pDelta vector was used as a 
template for amplifying hemAT-Hs gene by PCR using proofreading DNA polymerase 
1 5 pfu. PCR product was cloned into TOPO vector (Invitrogen TOPO cloning Kit). The 
insert was checked and confirmed by digestion and PCR. This construction was used as 
template for generating serial histidine mutants. 

The plasmid construction from above was used for mutagenesis PCR. His 20, His 
71, His 123, His 198, and His 214 were mutated to alanine by PCR-based site-directed 
20 mutagenesis (described above). Mutated hemAT-Hs gene in Topo vector has been 
checked by manual sequence as well as Auto Sequencer 373. 



Table 4: Primers for mutagenesis. 



Primer Name 


Sequence 






H20A 


GGAACGGGATCGACGGGgccGCACTCGCGGACCGG 


SEQ. ID. No. 64 


H20A-R 


CCGGTCCGCGAGTGCggcCCCGTCGATCCCGTTCC 


SEQ. ID. No. 65 


H70A 


GACCGACTTCTACGACgccTTGGAGTCCTACGAGCG 


SEQ. ID. No. 66 


H70A-R 


CGCTCGTAGGACTCCAAggcGTCGTAGAAGTCGGTC 


SEQ. ID. No. 67 


H123A 


CCGTATCGGGAAGATAgccGACGTGCTCGGGCTCG 


SEQ. ID. No. 68 


H123A-R 


CGAGCCCGAGCACGTCggcTATCTTCCCGATACGG 


SEQ. ID. No. 69 


H198A 


CGTACGCCCAGCGCCTCgccGACGAGATCGACAGCC 


SEQ. ID. No. 70 


H198A-R 


GGCTGTCGATCTCGTCggcGAGGCGCTGGGCGTACG 


SEQ. ID. No. 71 


H214A 


GCGAACGCGGTCGCCACGgccGTGGAAGCACCGCTG 


SEQ. ID. No. 72 


H214A-R 


CAGCGGTGCTTCCACggcCGTCYGCGACCGCGTTCGC 


SEQ. ID. No. 73 



25 

Total of 10 mutants have been done, including H20A, H70A, H123A, H198A, 
H214A, H20/70A, H20/123A, H70/123A, H20/70/123A. 
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Example 10 - Expression of mutated hemAT-Hs 

The hemAT-Hs/pTOPO construction was used as initial plasmid for the 
5 subcloning of hemAT-Hs gene into different vectors. Three different expression systems 
were used. First, the pMAL system was used for expression in E. coli (Fusion protein 
expression system). EcoRl and BamHl restriction digestion sites were used for cloning 
the mutated hemAT-Hs gene into pMAL vector. The protein expressed in this system is a 
MBP HemAT-Hs fusion protein. All of the mutants have been cloned into pMAL, 
10 expressed successfully, purified and spectra have been done as well. 

Second, the pET system is also used for expression of the peptides in E. coli. 
Ndel and BamHl restriction digestion sites were used for subcloning hemAT-Hs into pET 
vector. 

Third, in order to study the physiological function of HemAT-Hs in its native 
1 5 host, it has to be expressed in halobacterial AhemAT-Hs strain, a strain that hemAT-Hs 
gene has been deleted from its genome. Ndel and Xbal were used to clone mutated 
hemAT-Hs gene into a halobacterial shuttle expression vector pKJ427. hemAT- 
ifr/pTOPO plasmid was digested with Ndel and Xbal, as well as the shuttle vector 
pKJ427. Digested vector and hemAT-Hs insert were purified from agarose gel by 
20 GeneClean kit and ligated with T4 ligase at 4°C. Ligation reaction was transformed into 
E. coli competent cells. Colonies were inoculated, the plasmids were extracted and 
checked by double digestion and PCR. The final construction was transformed into 
halobacterial hemAT-Hs deletion strain for over-expressing HemAT-Hs in H. salinarum 
(standard halobacterial transformation protocol was used). Cultures were checked for 
25 expression of HemAT-Hs by immunoblot using both HC23 antibody and HemAT-Hs 
specific antibody. The clone with highest expression of HemAT-Hs was used for 
physiological study. 

Example 11 — Construction of a C-terminal His-tag of hemAT-Hs 

30 

In order to purify HemAT-Hs protein from its native host Halobacterium 
salinarum C-terminal His-tag was constructed. A two-step PCR strategy was used. First, 
an Ndel top primer and 20 nucleotide C-terminal of hemAT-Hs gene plus sequence 
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encoding 6-histidine primer were used for amplification. Second, using first round PCR 
product as template, Ndel top primer and 6-Histidine + Stop codon bottom primer were 
used for PCR. The primer (including BamHIIXbal cutting sites) was used to amplify the 
hemAT-Hs gene plus histidine codon as well as stop Codon right after 6-his sequence. 
5 TOPO cloning was used for cloning the PCR products. NdellXbal were used for 

subcloning of hemAT-Hs-6-YHs-stoY> construction into shuttle vector pKJ427. The final 
construction plasmid was transformed into hemAT-Hs deletion strain. 

Example 12 - HemAT-/fc overexpression construction in H. salinarum 

10 

Expression of HemAT-/Zy in Halobacterium salinarum was created in the 
expression vector pKJ 427. pKJ 427 plasmid contains a fedox promotor with an 
mevinolin resistant gene. Ndel and Xbal restriction recognition sites were used to clone 
the hemAT-Hs gene into the pKJ427 vector. Top primer with Ndel cutting site and 

1 5 bottom primer with Xbal cutting site were designed and used for amplifying hemAT-Hs 
gene from Halobacterium salinarum genomic DNA by proof-reading pfu DNA 
polymerase. The PCR product was cloned into TOPO cloning vector (Invitrogen) and 
transformed into E. coli competent cells. The plasmid containing hemAT-Hs gene in 
TOPO vector was subcloned into pKJ 427 vector by NdellXbal double digestion. The 

20 hemAT-Hs/pKJ427 construction was confirmed by PCR as well as NdellXbal double 
digestion and then, the plasmid was transformed into tShtrVIIl deletion strain using 
standard transformation protocol. After two week incubation, colonies were picked up 
and grown in halobacterial growth medium. Each individual culture was checked by PCR 
to confirm the presence of the plasmid, and by immunoblot to confirm the expression 

25 level of HemAT-/fr. 

Example 13 - Expression of hemAT-Bs 

As with hemAT-Hs, three expression systems have been developed. First, hemAT- 
30 Bs is expressed in the pMAL expression system. In order to express hemAT-Bs encoding 
protein HemAT-ifa in E. coli, expression primers were needed to amplify the hemAT-Bs 
gene. Not only the gene, but the ribosomal binding region upstream of the start codon, is 
required for the expression of HemAT-ifa in E. coli. 
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Table 5: Primers for PCR of UemAT-BS. 



Name of Primer 


Sequence 


hemAT-Bs 
BamHltop 


ATATGGATCCAAGGGGGATCATTGTAATGTTATTTAAAAAAG 
SEQ. ID. No. 74 


hemAT-Bs 
Pstlbot 


ATTACTGCAGCAACTGATTTTTAATTTAAGTTTACATAATGAACGC 
SEQ. ID. No. 75 



5 BamHl and Pstl were selected for the cloning of hemAT-Bs into expression vector 

and E. coli-Bacillus subtilis shuttle vector. hemAT-BsBamHl top/Pstl bot primers were 
used to amplify the hemAT-Bs gene from Bacilus subtilis gemonic DNA by PCR with pfu 
DNA polymerase. After PCR, the amplicon was immediately cloned into TOPO vector 
using invitrogen TOPO Blunt Cloning kit and transformed into TOP 10 E. coli competent 

10 cells. Colonies were checked for the right insert. 

BamHl and Pstl were also used for the cloning of the HemAT-ifo into pMAL ell 
vector as well as the shuttle vector pEB 112. hemAT-Bs/pMAL construction was 
transformed to E. coli pLysS cells for the expression. After IPTG induction, SDS gel 
showed two bands in comparison to the uninduced sample. The top band is HemAT-ite 

1 5 protein. The spectra is checked and the results showed clearly the hemeprotein signature 
peaks while the MBP itself doesn't show any peak at 410 nm and 541nm/580nm.) 

Second, the Ndel top and BamHl bot primers were used for the cloning of hemAT- 
Bs gene into pET vector. The ribosomal binding region is also included in front of the 
gene. TOPO cloning was performed after PCR reaction and the construction was 

20 confirmed by Ndel/ BamHl digestion as well as PCR. hemAT-BslpET construction was 
transformed into E. coli pLysS competent cells. IPTG was used for the protein induction. 
Spectra showed the specific peaks for hemeprotein. 

A peptide consisting of the N-terminal 190 or 250 residues was expressed in 
pMAL vector. A bottom primer at position 190 and 250 amino acid residue were 

25 synthesized with a Pstl cutting site. hemAT-Bs BamHl top and these top primers were 
used to amplify the gene encoding 190 and 250 amino acids at N-terminal of hemAT-Bs. 
The PCR products were cloned into TOPO vector and then subcloned into pMAL 
HemAT-Ss 250 vector by using BamHIIPstl. 190 and 250 hemAT-Bs/pMAL 
constructions were confirmed and transformed into pLysS cells for expression. As 
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expected, other than MBP, a second protein band appears at position 25 and 30 kDa, 
which are the sizes of N-terminal 190 and 250 residues of HemAT-jS^ protein. Spectra 
also showed the signature peaks of hemeprotein. 

A shuttle vector is used for the expression of hemAT-Bs gene in its native host 
5 B.subtilis. hemAT-Bs ITOVO construction was used as initial plasmid. The hemAT- 
Bs/pEB 112 construction was transformed into AhemAT-Bs deletion strain. The 
transformant was used for physiological study of HemAT-ifa. 

Example 14 - Construction of a C-terminal His-tag HemAT -2?s 

10 

Two round of pfu PCR were performed to generate a C-terminal 6 His-tag to 
HemAT-ifo. The top primer and bottom primer with 6 Histidine codon plus stop codon 
were used for the first round PCR. The PCR product was cloned into TOPO vector and 
the resultant vector used for the second round PCR. In the case of pET, Ndel and BamHl 
15 sites were created to clone the insert into expression vector. In the case of pMAL, BamHl 
top and BamHl bot primer were used. The final constructions (pET/pMAL) were 
transformed to E. coli pLysS cells for induction. 

Example 15 - Site-Directed Mutagenesis of HemAT-Z?5 

20 

The same strategy is used for generating site-directed mutants for HemAT-i?.? of 
B. subtilis. The HemAT-2fa/TOPO construction with BamHl top and Pstl bottom 
restriction sites was used as template for PCR-based mutagenesis. HemAT-i?s H75A, 
H86A, H99A, H122A, H123A and H199A are being mutated by PCR-based mutagenesis. 
25 The HemAT-Ite/pTOPO plasmid was used as initial template for PCR. 

The mutants, H75A, H99A, and H123A, have been cloned into pMAL expression 
vector. H123A spectra showed no significant signature peaks at 540 nm and 580 nm. 
H123R from pMAL expression culture showed no hemeprotein signature spectra. 
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Example 16 -Carbon Monoxide Binding in HemAT-Hs and HemAT-i?s 

The rate of CO binding to both HemAT-Hs and HemAT-Bs was determined by 
5 transient absorption spectroscopy using instrumentation described previously (Larsen,et 
al., Inorg. Chim. Acta 234:101-107(1995), which is hereby incorporated by reference). 

The rates of CO dissociation for HemAT-i/s and HemAT-ifa were determined 
using the ferricyanide method (Gilles-Gonzalez, et al., Biochemistry 33:8067-8073 
(1994), which is hereby incorporated by reference). Changes in absorbance as a function 
1 0 of time at 41 8 ran (Soret maximum for the CO bound derivative of each protein) were 

monitored after the addition of potassium ferricyanide (final concentration of 1 .5 mM) to 
solutions of the co-bound protein. The resulting traces were then fit to single exponential 
decays to obtain koff assuming the following reaction: 

15 ^ Fe(III)(CN-)6,kl 

CO-Fe(II)HemAT-Xx < — > Fe(II)HemAT-Xx + CO > Fe(III)HemATs 

K on 

where koff/kon are the dissociation/association rate constants and kl is the rate of HemATs 
20 oxidation. This procedure relies on kl being much larger than koff. In the case of the 

HemATs proteins this was confirmed by measuring the rate of heme oxidation of the five- 
coordinate deoxy form the protein. 

The optical absorption spectrum of deoxy and CO-bound derivatives of HemAT- 
Hs and HemAT-^ are shown in Figure 4. The absorption spectra of the deoxy forms of 
25 both proteins are indicative of five-coordinate high-spin heme with Soret maxima at 425 
nm and a broad visible band centered at 555 nm. In the presence of CO the absorption 
spectrum resembles a six-coordinate low-spin heme with a Soret maximum at -418 nm 
(UemAT-Hs/Bs) and visible bands at 535 nm and 573 nm. 

Figure 6 displays typical transient absorption data subsequent to CO photolysis 
30 obtained at 430 nm at 25°C and 1 atm CO for both HemAT-Hs (solid line) and HemAT- 
Bs (dotted line). The data can be fit to a single exponential decay indicating a pseudo- 
first order reaction with CO. The resulting rate constant for CO recombination are found 
to be 30±3 s" 1 and 132±3 s" 1 for HemAT-Hs and HemAT-^, respectively. Figure 7 shows 
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the corresponding transient difference spectrum (25 [as subsequent to photolysis) overlaid 
with the equilibrium difference spectrum (deoxy minus CO-bound) for HemAT-Hs (top 
panel) and HemAT-fe (bottom panel). The red-shift in the transient difference spectra 
relative to the equilibrium difference spectra suggest that CO photolysis produces a non- 
5 equilibrium five-coordinate complex within 25 \is subsequent to photolysis. 

Figure 8 displays the CO-off rate data for HemAT-Hs, HemAT-i&, and horse 
heart Mb. The CO off-rates are found to be 0.2±0.01 s" 1 , 0.098±0.002 s" 1 , and 
0.056±0.001 s" 1 for HemAT-Hs, HemAT-ifo, and horse heart Mb, respectively. Using 
these values along with the second-order rate constants for CO recombination (scaling the 
10 pseudo first order rate constants to CO concentration) the associations constants for CO 
are found to be 1.5xl0 4 M" 1 , 1.35xl0 6 M" 1 , and 7.38x1 0 6 M" 1 for HemAT-Hs, HemAT-Bs, 
and horse heart Mb, respectively. These values along with literature values for CO 
binding to other heme proteins are provided in Table 6. 

1 5 Table 6: CO- Affinities of various heme proteins. 



Protein 


K(xlO"' t M" 1 ) 


KontxlO-'M'V 1 ) 


Koffts" 1 ) 


HemAT-/fr a 


15 


3 


0.2 


HemAT-Ss" 


135 


13.2 


0.098 


HHMb" 


738 


46.5 


0.06 


SWMb° 


2700 


51 


0.019 


SW Mb H(E7)->L° 


110,000 


2,600 


0.024 


Human HbA c 


50,000 


600 


0.013 


BjFixL c 


10 


0.5 


0.045 


RmFixLT c 




1.2 




RmFixLH c 


20 


1.7 


0.083 


HPvP (pH 7.0) c 


350 


0.3 


0.0001 


Aplaysia Mb c 


3,000 


50 


0.02 



a This work. 

20 b Springer, et al, Chem. Rev. 94:699-714 (1994), which is hereby incorporated by 
reference. 

c Gilles-Gonzalez, et al., Biochemistry 33:8067-8073 (1994), which is hereby 
incorporated by reference. 

25 The absorption spectra of oxy-, deoxy-, and carbon monoxide forms of HemAT- 

Hs and HemAT-Ss establish that both proteins have a heme prosthetic group to reversibly 
bind oxygen. Capillary assays demonstrate that both HemAT-/fc and HemAT-ifc are 
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involved in negative aerotaxis in phylogenetically distinct archaeon H. salinarum and 
gram-positive bacterium B. subtilis, respectively. Thus, the N-terminal segments of 
HemAT-/fc and HemAT-tfs may act as sensory domains by binding diatomic oxygen 
through the heme prosthetic group in the ferrous (Fe(II)) state. This oxygen binding 
5 triggers a conformational change in the sensor domain, which in turn alters the activity of 
the C-terminal signaling domain. This initiates association of the signaling domain with 
CheW and CheA proteins to generate signals that change the flagellar rotational bias. 

Current evolutionary reconstruction indicates that myoglobin, (a- and (3-globins 
derive from a protein that originally appeared in an ancient vertebrate about 500 million 

1 0 years ago (Hardison, Amer. Scientist 87: 126-137 (1999), which is hereby incorporated 
by reference). However, comparison of amino acid sequences in globins from Eukarya 
and Bacteria suggests they share a very early common ancestor, in spite of the fact that 
the proteins perform different functions (Hardison, Amer. Scientist , 87:126-137 (1999); 
Hardison, J. Exp. Biol. . 201:1 099- 1 1 1 7 (1 998), which are hereby incorporated by 

15 reference). The conserved residues among all myoglobins are the proximal histidine 

residue in the F helix (F8) and two phenylalanine residues in the CD region (CD1 packs 
against the heme and CD4 in a hydrophobic cluster in contact with the heme), the distal 
histidine residue in the E helix (E7) and a proline residue at the beginning of the C helix 
(C2, sharp turn between B and C helices) (Bashford et al., J. Mol. Biol. . 196:199-216 

20 (1987); Vinogradov et al., Comp. Biochem. Physiol. . 106B:l-26 (1993), which are hereby 
incorporated by reference). Three of these residues (proline in C2, phenylalanine in CD4 
and histidine in F8) are conserved and phenylalanine in CD1 is replaced by valine in 
HemAT-Hs and HemAT-ifo (marked with asterisks in Figure 1 A). 

HemAT proteins constitute a new class of sensors that differ significantly from the 

25 known heme-containing O2 -sensor FixL (1 6, 1 7). FixL is a member of the large family 
of sensor kinases ubiquitous in bacterial two-component regulatory systems. Its heme- 
binding domain belongs to the PAS-domain superfamily (18, 19). HemATs contain no 
PAS domains (Taylor, et al., Ann. Rev. Microbiol. . 53:90 (1999); Zhulin et al., Mol. 
Microbiol. . 29:1522 (1998), which are hereby incorporated by reference) and differ from 

30 FixL both in spectral features and physiological function (Gilles-Gonzalez, et al., Nature. 
350:170 (1991); Lois, et al.. J. Bacteriol. . 175:1103 (1993), which are hereby 
incorporated by reference). The absorption bands of HemATs are blue-shifted relative to 
FixL (415 nm Soret band), indicating that the proteins have distinct heme-pocket 
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geometries. In addition, both HemATs participate in aerotaxis, whereas FixL regulates 
transcription. HemATs also differ from the aerotaxis transducer Aer in E. coli, which has 
a FAD-binding PAS domain (Rebbapragada et al., Proc. Natl. Acad. Sci. U.S.A. , 
94:10541 (1997); Bibikov et al., J. BacterioL 179:4075 (1997), which are hereby 
5 incorporated by reference). 

The amino-terminal domains of HemATs are proposed to act as sensors by 
binding diatomic oxygen at their heme when it is in the ferrous (Fe [II]) state. Oxygen 
binding presumably triggers a conformational change in the sensor domain that, in turn, 
alters the activity of the carboxyl-terminal signaling domain. The carboxyl-terminal 

10 domains of HemATs are very similar to the signaling domains of the MCP family of 

bacterial chemoreceptors, which associate with the cytoplasmic CheW and CheA proteins 
to mediate chemotaxis. 

HemATs offer the possibility of being used as biological sensors to monitor 
physiologically important gases, such as 02 or CO, because: 1) they are soluble proteins 

15 like myoglobin, which has been widely studied at the molecular level; 2) they possess a 
signaling domain that resembles those of the molecularly well-characterized bacterial 
chemotaxis transducers; and 3) direct observation of the aerotactic response permits rapid 
analysis of various perturbations of the sensing and signaling system. In addition, these 
two proteins provide information about the evolutionary origins of globins in the 

20 Eucarya, Archaea, and Bacteria. 

Although preferred embodiments have been depicted and described in detail 
herein, it will be apparent to those skilled in the relevant art that various modifications, 
additions, substitutions, and the like can be made without departing from the spirit of the 
invention and these are therefore considered to be within the scope of the invention as 

25 defined in the claims which follow. 
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What is claimed is: 

1 . An isolated bacterial heme binding protein wherein said protein reversibly 
binds oxygen with a low affinity and wherein said a heme binding domain of said protein 

5 shows at least 20% identity to a myoglobin heme binding domain. 

2. The isolated heme-binding protein according to claim 1, wherein the 
protein comprises a heme binding domain and a signaling domain. 

10 3 . The isolated heme-binding protein according to claim 1 , wherein the 

protein is isolated from Archaea. 

4. The isolated heme-binding protein according to claim 3, wherein the 
protein is isolated from Halobacterium salinamm. 

15 

5. The isolated heme-binding protein according to claim 4, wherein the 
protein's activity is salt tolerant. 

6. The isolated heme-binding protein according to claim 1 , wherein the 
20 protein has an amino acid sequence of SEQ. ID. No. 2. 

7. The isolated heme-binding protein according to claim 1 , wherein the 
protein is isolated from Bacillus subtilis. 

25 8. The isolated heme-binding protein according to claim 7, wherein the 

protein has an amino acid sequence of SEQ. ID. No. 4. 

7. A fragment of the isolated heme-binding protein according to claim 1 , 
wherein said fragment comprises a heme-binding domain. 

30 

8. The fragment according to claim 4, further comprising a heterologous 
signal transduction domain. 
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9. A blood substitute comprising: 

a bacterial heme binding protein wherein said protein reversibly binds oxygen 
with a low affinity. 

5 10. The blood substitute according to claim 9, wherein the protein comprises a 

heme binding domain and a signaling domain. 

1 1 . The blood substitute according to claim 1 0, wherein the protein is isolated 
from Archaea. 

10 

12. The blood substitute according to claim 1 1, wherein the protein is isolated 
from Halobacterium salinarum. 

13. The blood substitute according to claim 12, wherein the protein's activity 
15 is salt tolerant. 

14. The blood substitute according to claim 9, wherein the protein has an 
amino acid sequence of SEQ. ID. No. 2. 

20 15. The blood substitute according to claim 9, wherein the protein is isolated 

from Bacillus subtilis. 

16. The blood substitute according to claim 15, wherein the protein has an 
amino acid sequence of SEQ. ID. No. 4. 

25 

17. The blood substitute according to claim 15, comprising a fragment of the 
isolated heme-binding protein having a heme-binding domain. 



18. The blood substitute according to claim 17, further comprising a 
30 heterologous signal transduction domain. 
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1 9. A method of treating a patient suffering from low blood levels comprising: 
administering to the patient a blood substitute according to claim 9. 

20. The method according to claim 19, further comprising: 

5 regulating the oxygen binding of the heme-binding protein by modifying the signaling 
domain. 

21 . A method for controlled storage of oxygen, comprising: 

providing a bacterial heme binding protein wherein said protein reversibly binds 
1 0 oxygen with a low affinity; and 

contacting said protein with oxygen allowing the protein to bind and store oxygen. 

22. The method according to claim 21 , further comprising: 
triggering the release of oxygen from the protein by activating the signaling 

15 domain. 

23 . The method according to claim 21 , wherein the protein comprises a heme 
binding domain and a signaling domain. 

20 24. The method according to claim 21 , wherein the protein is isolated from 

Archaea. 

25. The method according to claim 24, wherein the protein is isolated from 
Halobacterium salinarum. 

25 

26. The method according to claim 25, wherein the protein's activity is salt 
tolerant. 

27. The method according to claim 26, wherein the protein has an amino acid 
30 sequence of SEQ. ID. No. 2. 

28. The method according to claim 21 , wherein the protein is isolated from 
Bacillus subtilis. 
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29. The method according to claim 28, wherein the protein has an amino acid 
sequence of SEQ. ID. No. 4. 

5 30. The method according to claim 2 1 , wherein the protein is a fragment of an 

isolated bacterial heme binding protein which reversibly binds oxygen with a low affinity, 
wherein said fragment comprises a heme-binding domain. 

3 1 . The method according to claim 30, wherein the fragment further 
10 comprising a heterologous signal transduction domain. 

32. A method of sensing gaseous ligands comprising: 

providing a heme binding bacterial protein wherein said protein reversibly binds 
oxygen with a low affinity; 
1 5 exposing said protein to a sample to be tested; and 

measuring a change in the conformation of the protein. 

33 . The method according to claim 32, wherein said measuring is carried out 
optically. 

20 

34. The method according to claim 32, wherein said measuring is carried out 
electronically. 

35. The method according to claim 32, wherein the gaseous ligand is selected 
25 from the group consisting of 0 2 , NO, CO, and CN. 

36. The method according to claim 32, wherein the gaseous ligand is 0 2 . 

37. The method according to claim 32, wherein the protein comprises a heme 
30 binding domain and a signaling domain. 

38. The method according to claim 32, wherein the protein is isolated from 
Archaea. 
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39. The method according to claim 38, wherein the protein is isolated from 
Halobacterium salinarum. 

40. The method according to claim 39, wherein the protein's activity is salt 
tolerant. 

41 . The method according to claim 40, wherein the protein has an amino acid 
sequence of SEQ. ID. No. 2. 

42. The method according to claim 32, wherein the protein is isolated from 
Bacillus subtilis. 

43 . The method according to claim 42, wherein the protein has an amino acid 
sequence of SEQ. ID. No. 4. 

44. The method according to claim 32, wherein the protein is a fragment of an 
isolated bacterial heme binding protein which reversibly binds oxygen with a low affinity, 
wherein said fragment comprises a heme-binding domain. 

45. The method according to claim 44, wherein the fragment further 
comprising a heterologous signal transduction domain. 

46. A chimeric protein comprising: 

a heme-binding domain of an isolated heme binding bacterial protein; and 
a heterologous signaling domain. 

47. The chimeric protein according to claim 46, wherein the heterologous 
signaling domain is a mutated signaling domain having altered affinity for its ligand. 

48. The isolated heme-binding protein according to claim 47, wherein the 
protein comprises a heme binding domain and a signaling domain. 
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49. The isolated heme-binding protein according to claim 47, wherein the 
protein is isolated from Archaea. 

50. The isolated heme-binding protein according to claim 49, wherein the 
5 protein is isolated from Halobacterium salinarum. 

5 1 . The isolated heme-binding protein according to claim 50, wherein the 
protein's activity is salt tolerant. 

10 52. The isolated heme-binding protein according to claim 5 1 , wherein the 

protein has an amino acid sequence of SEQ. ID. No. 2. 

53. The isolated heme-binding protein according to claim 47, wherein the 
protein is isolated from Bacillus subtilis. 

15 

54. The isolated heme-binding protein according to claim 47, wherein the 
protein has an amino acid sequence of SEQ. ID. No. 4. 

55 . An isolated nucleic acid molecule wherein the nucleic acid molecule 
20 encodes a heme binding bacterial protein wherein said protein reversibly binds oxygen 

with a low affinity. 

56. The isolated nucleic acid molecule according to claim 55, wherein the 
nucleic acid molecule comprises: 

25 a nucleotide sequence as shown in SEQ. ID. No. 1; or 

a nucleotide sequence which hybridizes to a nucleic acid molecule having the 
sequence shown in SEQ. ID. No. 1 under stringent conditions. 

57. A vector comprising the nucleic acid molecule according to claim 55. 

30 

58. A host cell transformed with the vector according to claim 57. 
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59. The isolated nucleic acid molecule according to claim 55, wherein the 
nucleic acid molecule comprises: 

a nucleotide sequence as shown in SEQ. ID. No. 3; or 

a nucleotide sequence which hybridizes to a nucleic acid molecule having the 
5 sequence shown in SEQ. ID. No. 3 under stringent conditions. 

60. A vector comprising the nucleic acid molecule according to claim 59. 



61 . A host cell transformed with the vector according to claim 60. 

10 
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ABSTRACT OF THE INVENTION 

The present invention provides an isolated archael and bacterial heme binding 
5 protein which reversibly binds oxygen with a low affinity. The heme binding protein 

may be utilized as a blood substitute. The invention also provides a method for controlled 
storage of oxygen by contacting a bacterial heme binding protein with oxygen allowing 
the protein to bind and store oxygen. The also provides methods to sense gaseous ligands 
using the heme binding protein. In other embodiments, the invention provides chimeric 
1 0 proteins having a heme-binding domain of an isolated heme binding archael bacterial 
protein and a heterologous signaling domain. 




FIGURE 1 




FIGURE 2 



Mvoelobin-like Protein TMbLP) 



GQ DVLWLIKXHPLIQEKIXXFDFFKH | XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX | AQRXRLAQIHAXKGKIPDWYL | 

Ml-box M2-box 
SEQ. ID. No. 82 SEQ. ID. No. 83 



TEMPLATE 



IIKXTVPVLXEHGXXI GQDVLWLIKXNPEIQEKFFFFKH XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX | AQRXRL&QIHAXKGKIPDWYL | 



H-box Ml-box M2-box 

SEQ. ID. No. 84 SEQ. ID. No. 85 SEQ. ID. No. 

Figure 1 The two sequences used in the analyses. Ml and M2 are the site of myoglobin recognition. M2 is the 
site of Hem AT recognition. The H-box is the site primarily of microbial hemoglobin recognition. 
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